LACUS 


FORUM 
XLI 


Pragmatics, Cognition, and Culture 


a place of mind 





LACUS FORUM 


Journal of the Linguistic Association 
of Canada and the United States 


Jessica de Villiers 
The University of British Columbia 
397 - 1873 East Mall 


Vancouver, BC Canada V6T 1Z1 


Douglas W. Coleman, Webmaster 


University of Toledo 
3638 Shamrock Drive 
Toledo, OH 43615, U.S.A. 


Aroldo de Andrade 
Elissa Asp 
Mahmood Reza Atai 
Keith Bateson 

Carey Benom 
Jordan Boschman 
Lucas van Buuren 
Douglas W. Coleman 
Michael Cummings 
Kara van Dam 
Eftychia Damaskou 
Grazyna Drzazga 
Connie Eble 


Sheila Embleton 
Susan Feltman 
Jonathan Fine 
Adolfo Garcia 
Claire Gilchrist 
Tai-Chun (Grace) Hou 
Christina Keppie 
Michael D. Kliffer 
Sydney M. Lamb 
Shumin Lin 
Daniel S. Mailman 
Karen Malcolm 
Andrea Mates 


Editors of Volume XLI 


William Sullivan 


Maria Curie-Sktodowska University 
Pl. M. Curie-Sktodowskiej 5, 20-031 


Lublin, Poland 


Renison J. Gonsalves, Production Editor 


Brooklyn College 
2900 Bedford Ave. 


Brooklyn, NY 11210, U.S.A. 


Reviewers of Volume XLI 


Alan K. Melby 

Valeria Molnar 

Keiko Nakamura 
Jacqueline Nenchin 
Robert Orr 

Kellen Parker van Dam 
Suzanne Quay 

H. Stephen Straight 
William J. Sullivan 
Sarah Tsiang 

Jessica de Villiers 

S. Alexandria Wolochuk 
Hui Yin 


LACUS Forum is published by the Linguistic Association of Canada and the United States. 
Copyright © LACUS, 2020. All rights reserved. 


Printed in U.S.A. 


ISSN 0195-377X 


LACUS FORUM 


Journal of the Linguistic 
Association of Canada and the 


United States 
VOLUME 41 NUMBER 1, 2020 
CONTENTS 
PREFACE V 
Jessica de Villiers, William. Sullivan 


& Dan Mailman, 


“s 


1. Winner of the 2014Presidents’ Prize 
AN ANSWER TO Y. R. CHAO’S PROBLEM OF AFFRICATE DISTRIBUTION IN WU l 


Kellen Parker van Dam 


2. EFLIN SENIOR HIGH SCHOOLS: THE CASE OF TAIWAN 12 
Ying-ying Cheng, Seiji Shibata, Ching-chung Guey, 
Paul W. Mathews, & Katsumi Kiyonaga 


3. THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH: 31 
ANALYZING HYPOTHESES WITH A CORPUS BASED STUDY 
Aroldo de Andrade 
4, THE PICNIC LANGUAGE 43 
Christina Keppie 
5. THE RHETORICAL FUNCTIONS OF THE INDEFINITE ARTICLE: 52 


A COGNITIVE LINGUISTIC APPROACH 
Adrian Lou 


PREFACE 


THE 41°" LACUS FORUM was held at the Vancouver campus of the University of British Columbia from August 
6 to 9, 2014. The conference theme was Pragmatics, Cognition and Culture, a theme intended to explore 
broad connections between linguistic interaction, cognitive processing, and culture. This volume contains 
a selection of 19 peer reviewed papers from the meeting. 

The Forum opened with welcoming messages by Jessica de Villiers, the local organizer, and Janet 
Giltrow, Associate Dean in the Faculty of Arts. Invited lectures addressed the forum theme from both 
cognitive and cultural perspectives. The inaugural plenary lecture was delivered by Herbert H. Clark of 
Stanford University. His paper, “On the rational basis for communication,” argued that talk is a variety of 
rational behaviour subject to bounded rationality and presented a model of talk that accounts for basic 
conversational features as well as apparent departures from the ideal. Alexandra D’Arcy of the University of 
Victoria delivered a second plenary talk, “In a sea of Canadian English: Victoria’s linguistic legacy,” which 
presented a diachronic study of patterns of usage, variation, and change in the English spoken in Victoria 
British Columbia. 

Consistent with the thematic focus on culture, the program reflected linguistic diversity in different parts 
of the world. Many of the papers presented focused on research from outside North America or examined 
languages other than English. Topics included, for instance, affricate distribution in Wu (Parker van Dam), 
topicalization and its decay in Spanish (de Andrade), the representation of Acadian/New Brunswick French 
in literature (Keppie), and ancient Indian grammars (Malcolm). 

The Presidents’ Prize was awarded to Kellen Parker van Dam for his paper “An Answer to Y.R. Chao's 
Problem of Affricate Distribution in Wu,” which accounts for the distribution pattern of affricates in Wu 
dialects, showing they are not randomly distributed. Stephen Straight of Binghamton University delivered 
the presidential address “Form versus Content and Input versus Output: The Dual Bifurcation and 
Interdependent Synergy of Construing and Saying” during the banquet dinner held at the McGavin Rugby 
Centre on UBC’s campus. This address explored, among other things, the intermingling ofreception and 
expression in language acquisition and use. 

Of special note, this volume contains the last paper published by the late Jonathan Fine, an important 
contributor to the field of clinical linguistics and a great friend to many linguists, including the editors of 
this volume. His paper, “Modelling Transitions in the Language of Psychiatric Disorders,” emphasizes the 
characterization of local features in discourse that contribute to assessments of communication disorders. 

The editors express their thanks to the review committee and to the reviewers for their help in reviewing 
papers for this volume. We also thank Dorota Lockyer, Adrian Lou, and Jordan Boschman, graduate 
students from the Department of English who helped with the conference organization and hosting. LACUS 
also thanks the University of British Columbia, and in particular the Department of English, the 
Department of Athletics and Recreation, and John Foster, Department of Athletics and Recreation, for 
financial contributions and invaluable assistance. 


-Jessica de Villiers, William J. Sullivan, & Dan Mailman 


LACUS Forum 41:1.1-11 (2020) 


LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version a0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





AN ANSWER TO Y. R. CHAO'S PROBLEM OF 
AFFRICATE DISTRIBUTION IN WU 
KELLEN PARKER VAN DAM 
National Tsing Hua University 


Abstract: This paper aims to provide an explanation of a problem first pointed out by Yuen-Ren Chao in his 
Xiandai Wuyu de Yanjit (1956). In addressing an apparently random distribution of fricative initials and their 
corresponding affricates, Chao described it as “a mess” with “each [place] having their own [independent] 
manner” of articulation. In the current work I show how the distribution actually follows a clear pattern of 


lexical diffusion radiating out from Hangzhou on one hand and the Jianghuai Mandarin dialect frontier on the 
other. Furthermore, I posit that it is the Aerodynamic Voicing Constraint and the need to preserve phonemic 
voicing in Wu which is the original motivation for this change from voiced fricative to voiced affricates as a 
means of preserving the phonemic voicing distinction. 


Keywords: W4, Chinese, lexical diffusion, phonology 


Languages: Wu, Mandarin, Middle Chinese 


WU IS A SINITIC LANGUAGE spoken primarily in the Yangtze Delta region of East China. There are 
approximately 80 million speakers world-wide, with the largest population residing in Shanghai. Wu is 
spoken in the Shanghai municipality, Southern Jiangsti province and most of Zhejiang province, as well as 
small areas bordering Jiangst and Zhejiang. The language, descended from Middle Chinese, as are 
Cantonese and Mandarin, is known for its conservation of the early Middle Chinese voicing distinction of 
plosive onsets, lost in most other Sinitic varieties. Among Wu dialects, many show a clear trend of 
affrication of otherwise fricative voiced onsets. This is particularly prevalent in the Northern Wud dialects 
— also known as the Taihu dialect group —found in Northern Zhejiang province north and west of 
Hangzhou, as well as in Southern Jiangst' province and the Shanghai municipality. The trend was first 
pointed out in 1928 by Yuen-Ren Chao and as occurring only in words with one of a limited set of initials 
from the traditional Q/éyun classification system. The classification system has been in use since the 7th 
century, undergoing multiple revisions and reinterpretations, and is still widely referred to in modern 
Chinese dialectology. Despite the system having fallen out of favour with the rise of modern linguistic 
theory, it provides a convenient shorthand for scholars of Chinese. The specific initials brought up by Chao 
and which are discussed here are the ##& cong, 4 xié 7 chéng,  choéng, ii chudn and 7# chan initials. 

In his original description of the distribution issue, Chao describes the situation as “a mess,” with each 
dialect having its own variations (1956:39). By this he meant that there was no apparent pattern to the 
distribution as far as which words in this class have a voiced fricative onset as compared to those with voiced 
affricate onsets. He also did not have an explanation for the distribution of alveolars compared to 
alveopalatals. 


2 KELLEN PARKER VAN DAM 


Specifically, we see many cases where words that have Middle Chinese fricative onsets of /z/ or /Z/ may 
be pronounced with /dz/ or /dz/ onsets in modern dialects, but not consistently across the region or within 
a single dialect. For example, in Huzhou 744) dialect, flit [dzo?] corresponds to a recreated Middle Chinese 
form *zywit and Al [dz)] has been reconstructed as *z/. However, Huzhou also has words in the same class 
with [z] [z] and [dz] onsets. The same class of words may have onsets that are limited to just [z] in the 
dialects of Jidshan and Pinght *# 744, while in Changhua B(1E and Haining #3 they may be [z] [z] or [dz]. 
Other places such as Lin’an are like Huzhou in that the onsets may be [z] [Z] [d] or [dz]. 


Table 1: Sample of aftricate onsets across dialects. Data from Xt 2007. 


\ —od 


initial: ## # 8 WR BH FF it 
word: # FF 3B FH WS t # 
Anji 2A dz|z|z|z|dz|z2|z|z 


Changhua BiE dz dz zh dz 24 z z 


Changxing RER oz 2 zi 2zidzs2iszidz 


Déging #5 dz|2|)}z]24|2z|dz| z|z 
Fuyang BIS dz|z|zj|z2aJ|dz|z|dz|z 
Haiyan 7 BS dz|zjizJ|dz|zji{dz} zz 


Hangzhou fy =6dz dz dz dz dz dz dz dz 


Huzhou Ayn z|z2\|z|2|dz\|dz| z | dz 
Jiashan = Zi1Zi/Z/)/zZ2/)2z2|2z|2z{2z 
Jiaxing 3 i dz|dz| z|dz|z|dzj z | z 
Lin’an Fa 2 dz dz z 2a dzdzz dz 


Tongxiang Hh dz z z z dz dz dz z 
Yuhang BRAT dz dz dz dz dz dzoz z 


Aside from the assessment that things were messy, no further explanation was given in Chao’s original 
text. The lack of clear explanation for this distribution has continued in more recent works as well. In her 
detailed 2007 study of Northern Zhejiang dialects, XU Yue repeats Chao’s sentiment regarding the 
apparently scattered nature of the initials (Xu 2007). When they are addressed, it is typically left as a 
description on a dialect by dialect case without reasoning as to the cause of the irregularity being provided. 
This is the manner in which distribution is discussed in much of the literature. However, this is insufficient 
to explain the overall situation. It does not explain why it is the way it is. 

There is also a competing trend of deaffrication of voiced fricatives. This occurs not only in the initials 
presented by Chao, but also with onsets classified under other initials in the Qyéyiin system. This paper 
attempts to account for the current irregularity of affricate initials in Northern Wu dialects — also known 
as the Taihu dialects — as the result of these two competing trends. In doing so, I will also show the 


AFERICATE DISTRIBUTION IN WU 3 


geographical distribution as being explainable through conservativeness of earlier Middle Chinese onsets 
in some cases, and the result of lexical diffusion of affricates in the remaining cases. 


1. CONSERVATION OF PHONEMIC VOICING DISTINCTIONS IN WU. An important feature in modern Wu 
dialects is the preservation of a three-way phonemic distinction between voiced and both aspirated and un- 
aspirated voiceless onsets. While the existence of a three-way voicing distinction is no longer considered 
the key Wu classification feature, its importance in Wu dialects is still unquestionable. The origins of the 
tripartite distinctions in WU, as well as the marked difference in articulation between northern and southern 
dialects, was discussed in Norman (1988) and expanded upon in Colbin (2002). 

Colbin points out that the cases where Jianghuai and Wu pronunciations match, it is due to the 
southward immigration of Mandarin speakers. South of the Yangtze, the Jianghuai pronunciations are 
instead a literary stratum. These are still less common in southern Zhejiang even as literary pronunciations. 

In cases where there is a double match of [both literary and colloquial] forms in Jiang-Huai and 
Northern Wt, this reflects preservation north of the Yangtze of fragments of the original, pre-migration 
speech forms, which were historically related to the contemporaneous ancestor(s) of common Northern 
Wu. What determined whether the later, northern readings became dominant or were retained merely as 
reading pronunciations was the degree to which an old, pre-migration speech type survived in a particular 
area (Colbin 2002:536). 

It has been pointed out by Simmons (1999:56) that there is perhaps a more elegant way to address this 
distinction, not as actual distinctive voicing but rather as the additional phoneme /f/ in words with lower- 
register tones. This accounts for the murmur or breathy voicing qualities found in this tonal register, while 
also offering an explanation for the merger across the register as found in dialects like that of Shanghai. 
While I am in agreement with Simmons’ assessment, I will here follow convention in referring to the 
distinction as one of onset voicing. 

Though there are approximately a dozen instances of a word losing the voicing of the onset in one 
dialect or another, often without a corresponding change in tone contour from the voiced register to the 
voiceless register, such devoicing is the exception and not the rule. Within the dialects being addressed here, 
in nearly all cases the patterns of ongoing sound shifts preserve voicing. Only in a few very rare cases is 
voicing actually lost. For this reason, an explanation of the patterns should have voicing in mind. 


4hK 


2. METHODOLOGY. For the current work, sixty-seven words under the initials ## cong, 4b xié 7 chéng, 
choéng, ffi chudn and # chén (referred to here as the Chao initials) were compared across 28 dialects of Wu 
— all belonging to the Northern Wu (Taihu) branch of the language spoken in Southern Jiangst, Northern 
Zhejiang and the Shanghai municipality — as well as the transitional dialect of urban Hangzhou in Zhejiang 
province. In addition, four more words — two each of the H ri and #¥ guin initials —were also analysed 
across the same dialects. These are words separate from the Chao initials and which would typically have 
an affricate onset but have lost the affrication in some dialect. 

Shanghai data are largely from You (2013). Northern Zhejiang dialect data come from Xu (2007). 
Suzhou data are from Wang's Standard Suzhou Phonology Handbook (2009) and Suzhou Dialect Research 
(2011). Ningbo data are from Morrison (1876) and Tang et al. (1997). Changzhou data are from Zhou et al. 
(2011) with supplemental data from /Jangsu Shéngzhi (1998). Wuxi data are from interviews conducted for 
the current paper. 


2.1 RECONSTRUCTED MIDDLE CHINESE ONSETS. The initials given in Table 2 show Karlgren’s reconstruction 
as included in Modern Wii Research (Chao 1956). These do not reflect Proto-Wu but rather the Middle 


4 KELLEN PARKER VAN DAM 


Chinese that Chao was taking as a parent language. Despite a general acceptance among modern scholars 
that Middle Chinese clearly did not represent a single variety of speech — as well as regular examples of 
words in Wu with clear etymology that contrasts with the Tang rhyme tables —we will take the following 
reconstruction as a useful starting point for tracing the development of Taihu initials. 


Table 2: Karlgren’s reconstruction as appears in Chao (1928). 





initial @ FR we 4B f€) A BF 





reconstruction Z 

















d | dz 








Zz |dz/pnz| g 





#@ chénghas been reconstructed as an alveolopalatal stop in division III and division IV words. Likewise 
in these divisions § chdnghas been reconstructed as an alveolopalatal affricate. ff chudn and f# chdénare 
also reconstructed as alveolopalatal, occurring only in division III words. # cdéng and 4h xié are not 
alveolopalatal in Karlgren’s reconstruction. # gtin is reconstructed as /g/. It has become /dz/ in almost all 
modern dialects and in nearly all cases. The main exception is # under the # initial with a tone of [24 
yin gu where the reading is /gon/. In the Ningbo dialect of the late 1800s it was still pronounced /g/ 
(Morrison 1876) having not yet palatalised, though in the modern dialect it has also become /dz/. 

In his analysis of Common Wu, Chao combined the /dj/ onsets with /zh/, as well as combining /dz/ 
with /z/. This combination was repeated for Common Northern Wt (Simmons 1999). 


3. COMPETING SOUND SHIFTS IN TAIHU DIALECTS. There are two competing sound changes in the Taihu 
dialects. The first is lenition of voiced affricate onsets resulting in their voiced fricative counterparts. The 
second change is affrication, which in some cases is re-affrication, of voiced fricatives resulting in their 
voiced affricate counterparts. The first is taken to be an earlier shift that is still on-going, with some dialects 
being consistently more conservative. The second trend is more recent, attested in only the last 150-200 
years and possibly not occurring much outside that range. Both shifts are ongoing and will be addressed in 
turn below. 


3.1. SHIFT 1: LOSS OF AFFRICATE ONSETS. In some Northern Wu dialects there was a lack of voiced affricate 
onsets among the Chao initials. While Late-Middle Chinese reconstructions have affricate or obstruent 
onsets for some of the Chao initials —specifically those under 7% chéng, #€ céngand chiang WK (also written 
FR) — as part of Late Middle Chinese, these had been lost in less conservative Northern Wt dialects at some 
point before the re-affrication trend began. The change originates in Hangzhou but is also further along in 
Chongming and Changzhou, areas with significantly greater Mandarin influence due to proximity to 
Jianghuai Mandarin-speaking areas. 

Chao’s 1928 account of the Wu dialects shows Changzhou had already started undergoing the change, 
while Shanghai, SOngjiang, Suzhou and Wuxi were unaffected, as they mostly remain today. This is clear 
from his reconstructed forms, labeled HB guiyin, of which many are affricates or obstruents. These are also 
the reconstructions presented by Bernhard Karlgren, which were later revised by Li Fan-kuei. 

The reconstructed palatal obstruent onsets underwent affrication and merged with the alveopalatal 
affricate onsets. In his data there are no attested palatal obstruent onsets in under the 7% chéng initial. The 
/dz/ and /dz/ onsets then lost their affrication and became the corresponding fricatives. At a later stage, the 
distinction between /z/ and /z/ was lost. 


AFFRICATE DISTRIBUTION IN WU 5 

In many cases this process has led to a general uniformity of onsets for these initials. This is seen in the 

dialects of Shanghai and Suzhou and to a slightly lesser extent in Wuxi. It has been suggested by Zygis et al 

(2012) that a conflict in the aerodynamic requirements for voicing stops and fricatives will lead to such 
devoicing of affricates, explained below, which may have been a motivating factor. 


[d] [dz] [2] [Z] [dz] 


Figure I. Process of deaffrication in Taihu Wu dialects. 


3.1.1. THE AERODYNAMIC VOICING CONSTRAINT. The Aerodynamic Voicing Constraint states that voicing 
requires a sufficient difference in pressure, AP, between the sub-glottal pressure, Ps, and pressure in the oral 
cavity, Po (Chomsky & Halle 1968:300-301). When Po approaches Ps — for example during the sustained 
production of obstruents — voicing is lost due to a lack of airflow required for vocal fold vibration. 

In Ohala (2011, 1997) the significance of the AVC is described as it pertains to fricatives and stops. 
Zygis et al. (2012) provide additional evidence that voiced affricate onsets are unstable and likely to be 
eliminated from the phonetic inventory. Based on corpus data they've shown that voiced fricatives are less 
frequent than any other segments in the represented languages. The most convincing cause of the 
infrequency of voiced affricates given by Zygis et al. is the aerodynamic requirements of such segments. The 
requirement of higher pressure in the oral cavity for stops causes interference with the pressure 
requirements for fricatives, resulting in a greater tendency for voiced affricates to have longer voice-onset 
times or lose voicing altogether. 

Accordingly, the data presented in Chao (1956) show the results of deaffrication as having run its course 
in the dialects of Sizhou and the three Shanghai dialects given, being Urban Shanghai, Songjiang and 
Pudong. Wuxi shows a greater number of affricate onsets in Chao’s data, though this difference has since 
been lost in the modern dialect, where affricate onsets are now almost completely absent. 

On obstruent onsets in Wu dialects spoken in Northern Zhejiang and Southern Jiangsi, minimal sets 
are composed of voiceless aspirated, voiceless un-aspirated and voiced counterparts. The minimal 
phonemic difference for bilabial obstruents is /pha/ /pa/ and /ba/. In Southern Wu dialects, these are 
produced as as [pha], [pa] and [ba]. However, in Taihu dialects spoken in the North, the voicing has 
typically been lost. Instead, a strategy of adaptation to the AVC has come about where the following vowels 
take on breathy voicing, resulting in the same set above being produced as [pha] [pa] and [pa] in the 
northern Taihu dialects. 

Additionally, the split may be rendered as [pha], [pa] and [ba] or even [pha], [pa] and [pha]. There is 
still some disagreement as to whether the “voiced” initials actually do differ in quality from their voiceless 
counterparts, or whether it is strictly a difference in vowel quality. 


6 KELLEN PARKER VAN DAM 


3.2. SHIFT 2: AFFRICATION OF VOICED FRICATIVE ONSETS. 


3.2.1. GENERAL TREND OF AFFRICATION. Figure 2 shows an ongoing trend occurring in the vast majority of 
Northern Wu dialects, including those in Shanghai and Wuxi, while Figure 3 shows additional shifts 
present in the same dialects. It is Figure 2 that shows what we would generally expect, and indeed reflects 
the majority of changes. 


tc / ts 
te / ts 


Figure 2: Most typical process of affrication. 
fet. 
Z—+ dz ¢ 
te / ts 
Z Ss / SZ 


t's 
dz < 
ts 
Figure 3: More complete description of shifts. 


There is an additional trend in border areas —particularly on Chongming Island just north of urban 
Shanghai — for the /dz/ affricate to revert to /Z/ in the younger generation of speakers. This may simply be 
the result of analogy. With the exception of the few cases where New Chongming has returned to /z/ after 
Old Chongming has already shifted to /dz/, the dialect is consistently further along in the trend as compared 
to all mainland Shanghai dialects surveyed. 

Based on the distribution it can be shown that the presence of affricate initials is the result of influence 
from Mandarin dialect influence, either from the dialect spoken in Hangzhou or by way of contact with 
dialects along the Jianghuai Mandarin frontier. In each case where a clear centre of spread if evident, the 
focus is from places with strong Mandarin influence either by way of geographic proximity, as in 
Chongming and Changzhou, or by being immediately adjacent to Hangzhou’s Wu-like Mandarin dialect. 
This will be addressed in the following section in the context of historical migrations into the area. 


3.2.2. HISTORICAL MIGRATIONS OF MANDARIN SPEAKERS. The classification of the urban Hangzhou dialect 
has been controversial, with parties arguing for its classification as Wu on one side and as Mandarin on 
another. A convincing argument for its membership among Mandarin dialects was put forward by Richard 
VanNess Simmons in Chinese Dialect Classification (1999) and further supported by John Zhiqiang Yu's 
Optimization of Wu Dialect Classification (2000). In the current paper Hangzhou is taken to be a Mandarin 
dialect under heavy Wu influence, an opinion that is gaining popularity among some scholars of Chinese 
dialects but one which is still not supported by the Chinese Academy of Social Sciences. 

The cities of Hangzhou and Nanjing were traditionally Wu speaking areas. This began to change in the 
12" and 13' centuries as wars and conquests in the north pushed more and more Mandarin speakers south. 


AFFRICATE DISTRIBUTION IN WU 7 


The first major migrations coincided with the Jurchen conquests and the fall of the old Northern Song 
capital of Kaifeng. The Song Court fled south, settling early Nanjing and later moving to the city of Lin’an, 
now known as Hangzhou and not to be confused with the nearby modern town of the same name. A second 
wave of migration occurred as the Mongols pushed into the recently claimed Jurchen territory before 
continuing on to the Southern Song lands in the Sichuan basin and the Yangtze Delta region. 

These migrations substantially changed the linguistic landscape, with Nanjing now being almost 
exclusively Mandarin-speaking, with a few rural exceptions such as the districts of Gaochtn and Lishur in 
the far south of the city. The region was further affected by later migrations of Mandarin speakers during 
the Taiping Rebellion in the mid-1800s. This is particularly evident in the dialects of Jintan, which are split 
into an Old and a New variety. The New Jintan dialect is that spoken by descendants of these later 
migrations. Neighbouring areas also saw a marked increase in Mandarin speakers coming from Northern 
Jiangsu, moving into the lands that has been wiped of former inhabitants by the long siege against Jintan 
(Simmons 1999:84-85). This is one of the major factors influencing modern Changzhou dialect, a clearly 
Wt dialect showing much stronger Mandarin influences than are seen in Wuxi, the city just east of 
Changzhou. 

This history has put the classification of the Hangzhou dialect in question, with many believing it to be 
a Wu dialect with heavy Mandarin influences, while others suggest it is a Mandarin dialect with heavy Wu 
influence. As has been shown by Simmons (1999) and Yu (2000), the dialect of Hangzhou is best classified 
as a Mandarin dialect with Wu influences. 


4, CONCLUDING REMARKS. The distribution of affricate onsets in the Taihu region can be explained through 
two competing sound shifts. First, there is a trend of deaffrication of the Chao initials due to aerodynamic 
voicing requirements in fricatives. Overlapping this trend is one of affrication of voiced fricatives. This 
spread of shift has characteristics of lexical diffusion, though in some cases this is at least in part due to 
conservation of earlier affricate initials not affected by the first trend of deaffrication. Additionally, rather 
than occurring as one shift followed by another, both shifts are overlapping and as such we may see a change 
within a dialect toward affrication in one generation but deaffrication in a subsequent generation. 

From the data we see that affrication among the voiced Chao initials is clearly an ongoing shift in 
Northern Wu dialects. Though the shift is fairly consistent, there are exceptions. 

Words may once again lose affrication or voicing before the full shift can occur, resulting in a handful 
of rare exceptions where /z/ has devoiced to /s/, taking it out of the set susceptible to change, or /z/ has 
become /dz/ only to again become /z/ at a later time. This is most evident in the last few generations of 
speakers of the Chongming dialect. 

It's also clear that the trend is further progressed in areas either bordering the Jianghuai Mandarin 
dialect frontier along the Yangtze River or those cities closest to Hangzhou. It’s worth noting that this is not 
the only case of clear diffusion coming out of Hangzhou in the last century. In Chao’s 1928 assessment of 
Wu, among the dialects surveyed only Hangzhou and Jingjiang dialects showed the typical Mandarin simple 
negative, 4S bu. This is signified by a bilabial onset in contrast to the more typical labiodental onsets on the 
simple negative. However, use of the Mandarin form of the simple negative has since spread to other Wu 
dialects, for example the dialect of Qidnxiydo Village Bi] FH#E* in Lishuy 387K, as well as the neighbouring 
Gaochun District (Phonemica 2014), both in Southwestern Jiangsti Province (Simmons 1999:79,141). Note 
that at the time of Simmons’ work with the dialect, the village was under the jurisdiction of Gaochtn fi. 
Recent changes in administrative divisions have resulted in its current status as a village in Lishur. The 
village is located to the northwest of Héféng Town 41 EVE. 


8 KELLEN PARKER VAN DAM 


In cases where the Chao initials displayed affricate onsets in any of the Northern Zhejiang dialects, the 
affricate was also present in the Hangzhou dialect with only two exceptions, #@ shéngand “J shdo.. Both of 
these exceptions had affricates in only two locations. In Chongming for both, far from the influence of 
Hangzhou and instead the result of influence from the North, and then in Tongxiang alone for #8 shéng 
and in Yuhang alone for “J shdo. In all other cases, any affrication on a given word in any of the dialects 
corresponds to affrication on the same word in the Hangzhou dialect. 

The trend has progressed the least in places like Suzhou or Wuxi, which are relatively isolated from the 
major centres of spread, bordered by the Yangtze to the north and Lake Tai to the south. In border dialects 
which exist on the far side of the river, for example Chongming or QidOng as Wu dialects on the north 
shore or around Nanjing and the nearby Jianghuai Mandarin dialects on the south shore, there is 
significantly more affrication. The additional geographic border of the mountain range between Anji and 
Déqing has also significantly shielded Déqing from much of the change. 

Save for the few changes that conflict with the trend, there is a predictable pattern. Certain sub-trends 
within the trend are apparent. Words under the *% chéng initial have a much higher incidence of affricate 
initials. Words with #@ céngand chong initials have a very high frequency of affricate initials but this is 
mostly limited to towns which are very close to Hangzhou or a Mandarin frontier. According to Xu, the 
one clear case where there’s a degree of predictability in regard to words under this category is with the 
chéng initials. Typically, these initials are pronounced [dz] except in Yuhang ER41, Changhua B/E, Jiaxing 
$3 88, Pinghu 774i, Haiyan #228, Haining #34, Tonexiang 48]7ih, Chongfi $m (referred to in Xu [2007] 
by an earlier name, Chongdé 5248) and Déqing ##/A. In other dialects this is usually not the case, and they 
are either [z] or [z]. Occasionally, they are also pronounced [dz] in some rural areas around Yuhang, 
Jiaxing, Tongxiang, Haining and Déqing. These exceptions are a minority, and we can generally assume 
that dialects in the above-mentioned towns will not have the affricate initial for % chéng initials (XU 2007). 

AN xié initials are much more likely to have a /z/ initial in areas far from Hangzhou. This is a 
conservative feature as it relates to Chao’s analysis and Karlgren’s reconstruction. However, in Hangzhou 
and nearby dialects as far as Jiaxing and Haiyan, the 4 x7¢ initial is still often produced as affricate [dz]. 
Old Chongming dialect shows this as well; however, New Chongming tends to be fricative [z]. 

The combined 48 x7éand 7% chénginitials are a good example of the two different trends being analysed 
here. 4h x7é is traditionally a fricative and is reconstructed by Karlgren in Chao (1956) as /z/. This typically 
corresponds to Mandarin words starting in /¢/ or to a lesser extent /s/ as in K& sux In dialects with a 
conservative 4 x7éonset, we should expect to see /z/ or /z/. However, we end up seeing a number of dialects, 
primarily around and including Hangzhou, which have a /dz/ onset as a result of affrication. On the other 
hand, the * chéng initial which has been reconstructed as /dz/ is almost always /z/ or more rarely /Z/ away 
from Hangzhou, with the one consistent exception being the Chongming dialect. 

Additionally, among the Shanghai dialects, it is primarily among younger speakers where /z/ is found 
for the 4 x7é initial, while older speakers produce the onset as /z/. It is yet to be seen whether this is the 
result of AVC affricate loss, as is proposed for many of the Chao initials, or if instead it is the result of some 
degree of levelling or hypercorrection that contradicts the general trend. 


AFFRICATE DISTRIBUTION IN WU 9 
Table 3. Re-attrication in Shanghainese dialects. 


character gloss Middle Chinese Songjiang Chuansha Urban Shanghai 


Ht wall Idz/ z— dz ZZ z— dz 


ase money /dz/ Z—>&% z— dz z— de 


An interesting issue comes up with {& cdng initial words. Like 7% chéng, this is reconstructed as an 


affricate, in this case /dz/. And like % chéngwe see the Hangzhou area dialects being more conservative in 
showing an affricate onset in the modern dialects, while more distant dialects show a fricative onset. So we 
can thus say that deaffrication has occurred in the northern dialects, such as those around Shanghai. 
However, we also find that the Shanghai dialects of Songjiang and Chuansha show more recent re- 
affrication on many words with #& cdng initial. This is an instance of the affrication process occurring after 
the deaffrication process, since in both cases a word which is under a traditionally affricate initial has a 
fricative onset in the speech of the older generation, but then an affricate onset among younger speakers as, 
shown in Table 3. This is also seen in Table 4, showing the shift in both directions in the dialect of Ningbo, 
a Taihu dialect spoken in the port city east of Hangzhou. Ningbo data from Morrison (1876) and Tang et 
al. (1997). 


Table 4. Competing trends in Ningbo, 


Character Gloss Middle Chinese Old Ningbo Modern Ningbo 


FE mixed /dz/ /dz/ /2/ 
FF neat, even /dz/ /Z/ /dz/ 
R appear /z/ leh /z/ or /dz/ 


Furthermore, there is at least one instance of a word which is not among the Chao initials — instead 
having a A riinitial — but is still subject to the trend in many dialects. The word {4 rénghas developed an 
affricate onset in the dialects of Hangzhou, Yuhang, Lin’an, Tongxiang, Huzhou, Changxing, Ningbo, Anji, 
Changzhou, and Chongming. This was previously a /z/ onset, with few dialects having /n/ or /l/. The 
reconstructed onset for H initials as used by Chao and Karlgren was /nz/. This then is a clear instance of 
the process of affrication occurring, possibly as an independent change on the words with a specific 
phonological context, but it is also possible that the change is the result of phonological levelling. This is 
significant, as native speakers are almost never aware of the traditional system of initials and rhyme classes. 
Thus while a variation that is closely following these classes — such as the difference between *& chéngand 
Aly xié — may be easily linked to conservation of earlier classes, for cases where affrication is occurring on 
words with traditionally fricative initials, we may more easily accept it as the result of ongoing shifts rather 
than residual earlier forms. 


10 KELLEN PARKER VAN DAM 


A final point worth making is the presence of alveopalatal [z]. In general, across the dialects where it is 
present, a [Z] onset is more commonly seen in words with a front high glide as part of the rhyme class. In 
many cases the medials [i] and [y] are sufficient to predict a [zZ] onset. Words lacking the medial have a 
much lower likelihood of showing alveopalatal onsets. In Middle Chinese, palatal affricates are largely 
limited to third-division initials . Third division finals had a palatal medial glide, while fourth division finals 
have high front vowels (Handel 2003). However, by the time re-affrication begins in Taihu, this earlier 
distinction may have been lost. This is more likely the case, at least among certain dialects, while others may 
simply be conservative regarding this feature. 


REFERENCES 


CHAO, YUEN REN. 1956 [1928]. #70t£. Xiandai Wiiyu' de Yénjiu. Beijing: Beijing Kexué Chubanshé, 
1956. 

CHOMSKY, NOAM, & MorrIs HALLE. 1968. The sound pattern of English. New York: Harper & Row. 

COLBIN, W. SOUTH. 2002. Migration history and dialect development in the lower Yangtze watershed. 
Bulletin of the School of Oriental and African Studies, University of London, Vol 65, No 3, 529-43. 

HANDEL, ZEV. 2003. A concise introduction to Old Chinese phonology, esp. pp. 543-76. Berkeley: 
University of California Press. 

JIANGSU HISTORICAL COMMITTEE. 1998. Jiangsu provincial annals Jiangsu Shéngzhi. Vol. 89. Nanjing: 
Nanjing University Press. 

MORRISON, WILLIAM T. 1876. An Anglo-Chinese vocabulary of the Ningpo dialect. Shanghai: American 
Presbyterian Mission Press, 1876. 

NORMAN, JERRY. 1988. Chinese. Cambridge: Cambridge University Press. 

OHALA, JOHN J. 2011. Accommodation to the aerodynamic voicing constraint and its phonological 
relevance. Proceedings of the 17th International Congress of Phonetic Sciences, 64-67. Hong Kong. 
<https://www.internationalphoneticassociation.org/icphs/icphs2011> 

---, 1997. Aerodynamics of phonology. Proceedings of the 4th Seoul International Conference on 
Linguistics. Also in Linguistics in the Morning Calm 4, Selected Papers from SICOL1997, ed. by the 
Linguistic Society of Korea, 1999:105-13. 

PHONEMICA. 2014. 4 #748. Recording 35, Gaochun dialect. 

--- 2014. Recording 36, Gaochtn dialect. 

SIMMONS, RICHARD VANNESS. 1999. Chinese dialect classification. Amsterdam, Philadelphia: John 
Benjamins. 

TANG, ZHENZHU JAIZ ERK, ZHONGMIN [¥f KEBLE CHEN, and XINXIAN 5S ¥T Jt WU. 1997 Xiandai 
Hanyu Fangyan Dacidian. In Ningbo Fangydn Cidian, ed. by Rong 22 Li. Nanjing: Jiangsu 
Educational Publishing House. 

WANG, PING 7£*F. 2009. Bidozhuin Suzhou Yin Shouce. Jinan: Qilu' Shushe. 

---. 2011. Suzhou Fangydn Ydanjiu. Jiangsu Fangyan Yanjiu Congshu. Beijing: Zhonghua Shuju. 

XU, YUE fR#K. 2007. Zhébei Hangjiahu Fangydn Yuyin Yénjiuz[in Mandarin]. Beijing: Chinese Academy 
of Social Sciences Publishing. 

YOU, RUJIE BILAN. 2013. Shanghai Diqu Fangydn Diaochd Yanjiu A lingustic survey of shanghai 
dialects. Shanghai: Fudan University Press. 

YU, JOHN ZHIQIANG. 2000. Optimization of Wu dialect classification. Shanghai: Fudan University Press. 


AFFRICATE DISTRIBUTION IN WU 1] 
ZHOU, XIAODONG Jal E48, YUAN JE JR ZHOU, LizAo NNT JIN, & GUORUI HE Ha Tv. 2011. 
Chdngzhou Fangyan Cidian. Changzhou Fangyan Congshu. Nanjing: Jiangsu Educational Publishing 
House. 
ZYGIS, MARZENA, SUSANNE FUCHS, & LAURA KOENIG. 2012. Phonetic explanations for the infrequency 
of voiced sibilant affricates across languages. Laboratory Phonology 3-2.2012: 299-336. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:1.12-30 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





EFL IN SENIOR HIGH SCHOOLS: THE CASE OF TAIWAN 


YING-YING CHENG SEIJI SHIBATA 
I-Shou University, Taiwan Shumei University, Japan 


CHING-CHUNG GUEY PAUL W. MATHEWS 
I-Shou University, Taiwan Philippine Studies Association of Australasia 


KATSUMI KIYONAGA 


Tizuka Nisshinkan 
Junior High School, Japan 


Abstract: This paper briefly introduces Taiwan’s educational system in general and Taiwan’s senior high 
school (SHS) education in particular, along with its relationship to the teaching of EFL in SHS. The curricula 
in SHS are subsequently exemplified with observations of 10 English classes that reflect, in those instances, the 
reality of teaching EFL. In the concluding remarks we question the contentions of educational authorities 
about EFL teaching in regard to the four basic aims as set out by the Ministry of Education (MoE) and reflect 
on the problematic nature of current EFL teaching, and future MoE plans for 2014. 


Keywords: senior high school, EFL, English teaching, education policy, Taiwan 


Languages: English 


IN 2014 TAIWAN’S 12-YEAR COMPULSORY EDUCATION PROGRAM from elementary to SHS was implemented 
(MoE 2012) in order to improve the quality of basic education and increase the national capacity of 
competitiveness in the world (MoE 2008). Until these changes take effect, however, SHS is not fully 
compulsory, although 97% of junior high school (JHS) students go on to SHS (Minister of Foreign Affairs 
of Japan 2013), with a nominal duration of 3 years. The primary aim of general SHS is to prepare students 
for university entrance examinations, while the aims of vocational schools are more diverse. About 95% of 
general and 82% of vocational students will go to university (MoE 2013). 

Thus, this paper explores the English proficiency of SHS students and their eligibility to enter university 
and, in some cases, cope with a curriculum that is delivered in part in English. We begin by outlining the 
current education system in Taiwan. Following a review of past literature on ESL in Taiwan and a 
methodological section, we present 10 observations of classroom teaching in general and vocational schools. 
These observations, we argue, illustrate a common reality of teaching EFL, and its assessment. We conclude 
with a discussion about why English proficiency in SHS is problematic in several respects, and that the 
MoE’s planned changes for 2014 may do little to address these. Our observations challenge the contentions 
of educational authorities about EFL teaching in SHS in regard to the premises, assertions, objectives and 
practices. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 13 


1. BACKGROUND: TAIWAN’S COMPULSORY EDUCATIONALSYSTEM. Senior high schools are divided into 
two main categories: 


1) General schools, which provide a general curriculum focusing on subjects such as Chinese, English, 
Civics, History, etc, to strengthen students’ general knowledge. Students in these schools have a clear 
ambition to enter university. 


2) Vocational schools are oriented more toward technical skills and particular occupations. They offer 
general subjects, and subjects on practical skills such as agriculture, industry, commerce, etc. 


The ratio between general and vocational schools was about 45:55 in 2011 (MoE 2011). Almost half of 
both school types are private, whereas 98% of JHS are public (MoE 2011; Clark 2010). Private tuition fees 
are about four times that of public schools (MoE2011). 

General schools provide 4 hours and vocational schools 2 hours of English per week in Grades 1-3, plus 
1-2 hours a week after school; students are required to take 150-160 Course Credits overa//for graduation. 
A Credit is equal to a Course having 1 hour of instruction per week; thus a 4-hour a week EFL Course = 4 
Credits. Generally, students should attain about 50 Credits per year, over 3 years, which implies they could 
have 10-12 Courses a year, of which ESL is only a small part. 

English classes are mandatory from the fifth grade. In 2001 The Curriculum Guidelines for Senior High 
Schoolwas introduced. This laid the foundation for high school textbooks in Taiwan. In 2005 The Tentative 
Curriculum Guidelines for Senior High School was introduced. It aimed to improve and foster students’ 
learning strategies and logical thinking and expand their horizons by learning about different cultures. 
Specific objectives are (Lee 2011): 


1) To enhance listening, speaking, reading and writing skills in English so as to apply these to real-life 
communication. 


2) To develop effective learning strategies and a positive learning attitude toward English learning in order 
to foster autonomous learning competence and lay the foundation for life-long English learning. 


3) To boost students’ interest in learning English and elevate their cultural and technical literacy. 


4) To better students’ understanding of foreign affairs and other cultures and implant the concept of respect 
for humanity and global sustainability. 


The core competencies listed in Table 1 were enumerated to ensure that students “are able to”: 


Table 1. The senior school English curriculum - Core competencies (Her 2007:127-136). 


Listening 1. listen to and understand classroom language. 
2. understand summaries of the lesson content and texts introduced by the teacher in 


the medium of English and understand questions relating to lesson content. 


3. listen to and understand most everyday English conversation 





14 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


. use common/main English classroom language 

. ask/answer in English simple questions that are relevant to the lesson 

. participate in English oral practice in class, communicate simple oral messages 

in English 
. use English accurately and fluently to read out short texts, stories, etc. 
. describe daily events in simple English 
Reading . read frequently used English signs and charts 

. read short stories and understand the main idea 

. read materials outside of class that are at a similar level to those encountered in 
class textbooks with the aid of dictionaries or other 
tool books. 


Writing . use capital and small letters and punctuation correctly 
. combine and alter sentences correctly 
. use appropriate words or phrases to create correct sentences 


. write answers in accordance with questions asked in relation to texts 
. translate simple sentences from Chinese to English and English to Chinese 


Combination . apply words and sentence structures appropriately in class and every day life 
of the 4 skills . read and fill out frequently used forms and applications 





Teachers in general classes offer more content and supplements for students than those of vocationally 
oriented classes. This is because students in general classes aim for further study in universities/colleges, 
while vocational students tend to find jobs upon graduation. But the latter, especially private vocational 
schools, need to offer more programs than general, public schools to meet students’ diversified needs, since 
some students at vocational schools may also choose academic-oriented programs. This makes the 
education system very complicated. 

The syllabus is generally scheduled around 4 lessons followed by a test, commonly in Weeks 7, 14 and 
20 in general schools, while in vocational schools, tests may be held after every 2 lessons. Thus, teachers in 
vocational schools do not cover as much of the so-called syllabus, which generally follows the text chosen 
by the teacher. 


2. PAST STUDIES. The current study enumerates and confirms several issues identified in a number of 
previous studies (e.g., Her 2007; Lee 2011), including our own earlier studies (Cheng et al 2012; 2013), in 
which more than 36 classes were observed in action and their teachers interviewed. Repeated themes arise: 
classes are too large and composed of students of different competency levels; that there is an emphasis on 
testing and memorizing; and many students are unmotivated or disinterested. Such themes are hardly 
surprising, given the testing and memorizing mentality vis-a-vis a liberal education. 

Such views have been expressed in several other studies of Taiwan education generally, and EFL in 
particular, and occasionally have been voiced by students themselves (Mathews 2015; Schulz 1996; Nunan 
1993:4). For example, Savignon’s & Wang’s (2003) study elicited the views of 174 first year university 
students in Taiwan. They found that there was a significant student preference for meaning-based classroom 
activities and a dislike for both form-focused teaching and the amount of class time devoted to the 
explanation and practice of rules of grammar. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 15 


Du-Babcock & Du-Babcock (1987) report the predominance of grammar-translation and audio- 
lingual methods in English language teaching in Taiwan. Similar results are supported by narrative accounts 
of Taiwan English language teachers reported in Wang’s (2002) study. Huang (1998) also reports on the 
need for approaches to the learning of English that centre on the use of the language. An understanding of 
learner attitudes and their perceptions of current teaching practices is important for an accurate definition 
of the function and goals of EFL and of the methods of teaching that best help learners to attain these goals. 

Jia-Huey Her (2007) notes that the Taiwanese national curriculum guidelines for schools, strongly 
influenced by traditionalist academics, reveal a number of problems relating to a lack of proficiency 
benchmarking and a lack of coherence, consistency, and transparency in some areas. Her also reports that 
test results of students indicated a wide variation in proficiency, with individual scores differing by as much 
as 64% in the case of exit-level students. 

MoE’s past policies and curricula implemented in 2001-2003 (Liang 2002:4), and further reforms to 
commence in 2014, were supposedly to mark a major change in the philosophy of education, signaling a 
move away from rote learning and memorizing. Yet, as our study shows, there is too much emphasis on 
testing, which largely requires memorizing and rote learning. Indeed, the number of tests students have to 
pass from a very young age is not only overwhelming from a pedagogical viewpoint, but also robs students 
of much intrinsic motivation and interest. 

The comprehensive exam system (/ian kao), set in place in the early 1950s by the KMT to govern access 
to the then few places in universities, has left a strong imprint (Shive 2000). Unfortunately, schools were 
evaluated simply by how many graduates got into the top universities, and so learning tended to become 
simply a matter of preparing for “the test” that required memorizing, and in which form was more important 
than content. Teaching mainly consisted of coaching students for their exams, which our observations 
clearly confirm. Thus, the curriculum became ossified—why study anything that was not on the test? 

There was no room for personal perspectives or interpretations; creativity, innovation, and artistic 
abilities were stymied. This is exactly what we see in Taiwan’s education today: an overemphasis on test 
scores leads to the neglect of learners’ overall ability. In the case of EFL learning, students tend to have a 
good command of grammatical knowledge (form) but lack pragmatic linguistic competence or intrinsic 
motivation. 

In recent years Taiwan’s business leaders have also criticized the existing system as emphasizing rote 
learning at the expense of creativity and initiative, skills required by the new knowledge-intensive 
workplace. Taiwan intellectuals returning from abroad have added force to this argument. The ability to 
retain factual information long enough until time for the test diminished the inclination to think creatively 
about new material, while students with other interests or talents were left behind. By the 1990s, Taiwan’s 
economy had moved well beyond assembly to more service and R&D intensive enterprises, requiring schools 
to produce more than obedient absorbers of sanctioned information. As one of our interviewed teachers 
asked her students: do you feelyou need to be taughtdifterently than you are now? 

Through its 2014 educational reforms Taiwan is taking a potentially bold step to end the examination 
system that has shaped the tension-filled transition from SHS to university; it appears to be attempting to 
divest itself of residual Confucian elements in its examination system to provide greater opportunities to a 
wider audience and, in keeping with the information age, moving beyond the heritage of “examination 
hell.” But how far has it come, or will go, when our study and others consistently note too much emphasis 
on rote learning, memorizing, and exams? 


3. METHODOLOGY AND LIMITATIONS. As with our previous studies (Cheng et al 2012; 2013)—which 
explicate the research methodology also used in the current paper—twenty schools were selected at random 
and contacted; ten of these agreed to participate in semi- structured observations of classroom teaching and 


16 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


subsequent semi-structured taped interviews of the teachers in the Fall semester of 2013. Five general and 
five vocational schools were included. For consistency in data collection, the same author who conducted 
the observations and interviews in our previous research undertook these tasks. Immediately following the 
class, she wrote a summation of the class and interviewed the teacher for about 30 minutes. Each teacher 
was asked to respond to several questions and prompts and was provided an opportunity to raise issues of 
importance to them. These interviews largely covered the perceived strengths and weaknesses of the classes, 
general problems identified by the teacher and his/her suggested solutions, and in particular the effects that 
the MoE’s 2014 policy implementation might have for EFLeducation. 

As noted in our earlier studies, there are several limitations to observation as a method that may apply 
to this study (Sarantakos 1998). For instance, each class was observed only once, hence findings cannot be 
readily generalized. Nor can observation easily measure opinions or attitudes, which can only be surmised 
from observed student behaviour. Observation, too, can be subject to observer bias and (selective) memory. 
The question of validity can also be raised with respect to observation as a method. This is addressed by 
several of the authors agreeing that the observations and supporting interview comments reflect what is 
being measured, i.e., students’ ability to integrate their English communication skills, their engagement with 
and interest in English learning, and an understanding and appreciation of different cultures. 

Nevertheless, this study, like prior ones, used a triangulation of methods on the assumption that each 
class would be different and hence other methods were less appropriate. These methods were best suited to 
the nature of this study and were consistent with our two previous studies. In addition, it also allowed 
teachers to present their views and suggestions. Thus, the following 10 observations of the classes draw on 
the teachers’ responses and the materials provided to the researchers, as well as the observer’s summation 
of her observations. 


4. OBSERVATIONS OF TEN SHS CLASSES IN KAOHSIUNG. The following observations of curricula and 
teaching methods were made in the Kaohsiung area during 2012-13. For this project three vocational and 
seven general SHS classes at Grade 2 were observed. Each class was 45 minutes and had on average about 
forty students. Students in each class performed at different levels of English competency. 

Our assessment of weaknesses and strengths for each class refer only to the observations on each occasion 
and draw on the various activities that the students were asked to do; these covered the four skills, reading, 
writing, listening, and speaking. For example, because vocational classes tend to emphasize listening and 
speaking, the vocational students were generally not so strong in writing and reading. This was evident 
when the teacher asked them to read a text aloud or write sentences. 

For each class the syllabus was essentially the textbook chosen by the teacher. Although some teachers 
varied this by adding their own material or changing the schedule, their mandate was to cover as much of 
the textbook as time allowed. We present several examples of their syllabi and schedules. 

There were seven female and three male teachers. Each was well experienced in teaching EFL, having 
taught for between six and 26 years. Their ages ranged from 32 to 55years. 


4.1. OBSERVATION 1. This private SHS was established in 1957. It was a clean and well- maintained campus, 
located in the Lingya District. It was evaluated in 2012 by the MoE and found to be an excellent high school. 
It teaches kindergarten and junior and senior high students. 

One vocational and one general class were observed at this school. The same teacher, who had 15 years 
experience, taught each class but in a different room. The classroom for the first observation was equipped 
with a clock, fans and air conditioning; it was airy and bright. There were 49 vocational students seated in 
7 YOWS. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 17 


Text and Syllabus: Eng/ish 4, Longtung Publishers. (n.d). The syllabus is indicated in Table 2 
Table 2. Syllabus for Observation 1. 


The view from Lia M-E-M-O: 4 tips for a better memory. 
Being on time. 9&10 |M-E-M-O: 4 tips for a better memory; Is your 
diet 
ce a destroying the environment? 
3&4 | Being on time; 
A wealth of 
4&5 |A wealth of health; 14} 10 Is your diet destroying the environment? 


S 5&6 |It’sa 5} 11 Sing with the poets. 
apnea 2 
How can you ay 
joe otra ge — 
6 | [Testy 


How can you buy 17 Eureka! Plus Book 5 Lesson 1. 
or sell the sky? 


8 7 Miyazaki’s world 18 | Book 5 | Lesson 2 in Book 5. 
&9 of fantasy. 


uo | 8 [Graffiti est 


Activities: The teacher began by explaining Lesson 2. She went through the vocabulary and grammar, then 
asked the students to study the text together: next + time + future and past tenses, e.g., We will go through 
the vocabulary next week before we study the next lesson; Robbie sent off his grades the next morning. The 





teacher also taught some vocabulary from Lesson 3. 


Strengths of the class: Listening and speaking. The teacher called on students to reply to some questions, 
which they did quite well. Overall, the interaction between students and teacher was lively. 


Weaknesses of the class: Because this was a vocational class that emphasized listening and speaking, the 
students were not so strong in writing and reading. 


Issues raised by the teacher: 
“In vocational classes teachers usually do not require students to do as much as those ingeneral 
classes, which have four hours of English and three hours after class, per week. They have twelve 
lessons to study in the semester—three tests, four lessons per test; the vocational classes have two 
hours of English and one hour after class, per week. The schedule for the general class is tighter and 
its content is more than that of the vocational classes. They have ten periods of class per day, so they 
need to stay at school until quite late, every day. Vocational classes have only 6 to 8 periods. We can 


18 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


learn at our own pace and take time to study without rushing. But there is a dilemma for us to use 
the same test for two types of classes that have a different exposure to English learning, in terms of 
the number of hours and the number of lessons each one does.” 


4.2. OBSERVATION 2. This general class was located in the same school as the class in Observation 1. It had 
the same facilities and was taught by the same teacher. The classroom was airy and bright, seating 42 students 
in 7 rows. 


Text and Syllabus: English 4 and English 5, Far East Publishers. (2012). The syllabus is indicated in Table 
3. The students had completed Lesson 1 in the previous semester. 


Table 3. Syllabus tor Observation 2. 


The view from the 11 M-E-M-O: 4 tips for a better memory. 
Pe eae EN ree 
Being on time. 12 | 9&10 M-E-M-O:4 tips for a better memory; Is your diet 
destroying the 
TE _ re 
3&4 Being on time; A Test 2 
POPS [rears PP 
‘punderful’ life. 


5 5 & 6 It’s a ‘punderful’ life; 5) a Sing with the poets. 
How can you buy or 
Pee 
fe [iret 


How can you buy or Eureka! Plus Book 5 Lesson 1. 
a SS es 
8&9 Miyazals s world of 18 | Book 5 Lesson 2 in Book 5. 
ey 


jo | 8 fGratitt Bt test 





Activities: Lesson 4, Jdioms and Phrases. The teacher focused on practicing and understanding several 
idioms and common phrases, e.g., generally speaking, loudly speaking, broadlyspeaking, heat up the soup/ water 
heater [heat v/n.]; be free from!/ feelfree to, make yourself yourselves athome. 


Strengths of the class: Reading and listening. Students were responsive; their interaction with their teacher 
was good. 


Weaknesses of the class: Writing and speaking. 


Issues raised by the teacher: 
“Students have 10 classes per-day so they need to stay at school until 7:00pm. This is a very 
heavy workload. The students’ competency in English varies from one extreme to another. It’s hard 
for us to use English all the time during the class. Even though we like to help students learn more, 
we also need to keep our teaching schedule, or parents may complain. We also use the same test for 
two types of classes and competency levels. The 12-year National education plan will be carried out 
in 2014. The junior high students, who are the last, take a Joint Examination entering into SHS next 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 19 


year. The weights of the test percentages will be one-third; their grades in their three-years of JHS 
will be counted when they apply for waiving the entrance exam to enter into universities. The 
problem of the 12- year plan is that different standards between schools will lead to two extremes. 
The MoE thinks it will contribute to a decrease in ‘star’ SHS. I think its weaknesses are more than 
its strengths.” 


4.3. OBSERVATION 3. This general school, established in 1922, was formerly a boys-only school, but has 
recently enrolled girls. It is one of the best schools in southern Taiwan. There are 72 classes in the school. 
A general class of 43 students seated in 11 groups was observed. It was an airy, bright classroom with a 
clock, TV, and fans. The class had 4 rather than 3 tests. 


Text and Syllabus: English Reader 4. Far East Book Company (2012). 


Activities: Lesson 4, A Wealth of Health. Listening. 

The teacher always used English in his class. First, he had students listen to 24 words with definitions, 
pronounced by a native speaker. He also showed pictures of fungi as examples and reminded students of 
the plural form with its correct pronunciation. He also talked about the different spelling in the US and UK, 
such as mold/mould. 

He asked students to write sentences on the board; e.g., “The dish was a much attraction by decoration of 
flower. He corrected this, then went through the articles in each paragraph. 

The teacher finally talked about the writing activity on eating vegetables that they would do in the next 
class. He talked about the benefits of vegetables and gave a sample story. 


Strengths of the class: Listening and reading. Students responded actively. Their interaction in the class 
was very good. 


Weaknesses of the class: Writing and speaking. 


Issues raised by the teacher: 

“T used to be frustrated with the old-style teaching method that focused on grammar details. I enjoy 
my teaching at this school. My students are great. I encourage and praise them. I choose good books 
that interest them. I also assign students to read novels. We will have a discussion at the end of each 
lesson. I told them to use English in their discussions. They have done what I told them to do. But 
they have 11 courses to study. Their core ability would develop better if some general education 
courses were cut. We could decrease the unnecessary burden for students by changing the system 
of evaluation; or evaluate if they can read newspapers in English, can write 300 words in English, 
and can take Certified Exams to substitute for the Joint Exam. How students are evaluated depends 
on how we teach. The objectives set by the MoE must be able to address real needs. I don’t have any 
idea of how to deal with the 12-year education plan next year.” 


4.4. OBSERVATION 4. This observation took place at a vocational school in the Lingya District. The 
classroom was equipped with fans and a TV; there was also a large notice about gender equality on the 
bulletin board. The 23 students in this vocational class sat in 6 rows. The teacher had 26 years teaching 
experience. 


Text and Syllabus: Hospitality English and Conversation IV. Longtung Publishers (n.d). The syllabus is 


20 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


indicated in Table 4. 


Table 4. Syllabus for Observation 4. 
4a | este 


7 | | Testh 
18-19 | 6 | Dealing with complaints) 
est 





Activities: Lesson 3, Announcements. The teacher asked one student to read one announcement, and 
another student to translate it; all students took turns doing the announcements and role-playing them in 
front of the class. Finally, she led the class in reading sentences, and invited volunteers to read them. 
Students were asked to take notes and to do the homework. 


Strengths of the class: Speaking. 


Weaknesses of the class: They were generally weak in English, which is considered less important than 
other practical courses among vocational students. 


Issues and suggested solutions raised by the teacher: 
“It is common that English is considered not as important as other subjects in the voca- tional 
program. Only a few students’ English is good. It’s hard for me to teach. I can only ask them to do 
what they can. I praise them when they do what I ask. There are three dis- abled students in my 
class who have difficulty in learning. I need to spend more time and energy to take care of them. I 
often give them encouragement, as I think it’s important to them.” 


4.5. OBSERVATION 5. This general SHS in the Cianjhen District was established in 1969. This old classroom 
was bright and airy and had fans and a projector. There were 36 students in this general class. 


Text and Syllabus: English Reader 4. Longtung Publishers (n.d). 


Activities: Lesson 7, Painting with words: Images in Poetry |a poem by Robert Frost, Stopping by Woods 
on a Snowy Evening /. 

The teacher greeted the students and had a free talk with them to make sure that they were ready to 
have today’s class. She then cut a poem into small pieces and passed them out to each group; it was like a 
jigsaw. Students were required to put the stanzas in the right order. Students worked in their respective 
groups and shared what they read with members in order to structure the 4 stanzas appropriately. She asked 
each group to share the reasons why they pieced the poem in their way. She then had them listen to a critique 
of the poem and share their feelings, to help them understand the poem better. 

They worked in pairs to find imagery and metaphors in the poem and share what they stood for. 

The whole class read the poem aloud. She then introduced the form of the poem, its 16 lines in 4 stanzas, 
and its rhyme scheme. Subsequently, the students gave feedback about the day's lesson and whatever they 
wanted to share. She then assigned homework. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 21. 


Strengths of the class: Reading. 


Weaknesses of the class: Generally, they were weak in listening, speaking and writing. Only a few were 
particularly good at speaking. 


Issues raised by the teacher: 

“It’s my first time to lead a science-major class. They are smart, but they don’t study hard, they just 
want a pass in English. They were in the top 5 last semester, but they became the 6" from the bottom 
in the Section Test in March, school-wide. If they studied hard, their performance could be better. 
I have had a successful experience of using all English in one high school. All of the class passed the 
General English Proficiency Test (GEPT). I group the class based on the Section Test. There are 4 
students per group: one higher-grade student, two with passes, and one with a lower grade. I also 
choose a TA from less- performing students who help my handling of the class. Ill review the 
question for the whole class. I don't require my students to preview lessons, but they need to review 
the lesson after the class. To deal with the MoE’s new policy, we set up a new English Elite Class this 
year. It helps us recruit good students to study at our school, who may help strengthen our 
competitiveness to face the new situation.” 


4.6. OBSERVATION 6. This general SHS is affiliated with and shares the campus with a national university 
in the Lingya District. It wasestablished in 1978. It has all school levels on one campus, which helps attract 
students to study there. There are 24 classes at the SHS level. Each grade has one elite class in Maths and 
Science, and now in English, which is in response to the forthcoming 12-year MoE policy. 

The general class observed was a Maths and Science elite class, in a room with fans, a projector, and air 
conditioning. The 24 students sat in 6 rows. 


Text and Syllabus: English Reader 4. Longtung Publishers (n.d). Activities: Lesson 8, Freud and the meaning 
of dreams. 

After warming-up, students actively responded to the teacher’s questions: “Who is Dr. Freud? How to 
interpret dreams? What do dreams signify to you? What does flying in a dream mean?’ She also gave related 
vocabulary (psychological, psychologist, psychiatrist) and used ‘dream’ in sentences, e.g., I had/dreamed a 
dream last night, daydream, dream walk, dream talk. 

She had students close their books and listen as she twice read aloud paragraph 1 of the text; the speed 
was faster the 2nd time. She then asked them questions aboutit. 

She also explained the function of punctuation such as the semi-colon that can be used as a conjunction, 
e.g., He has three sons. Two of his sons are married. => He has three sons; two of them are married. 

She gave them a website to help them do their homework. They will tell the class in their next session, 
‘What do they think the four pictures [on p. 173] may represent in their waking life?’ 


Strengths of the class: It’s an elite class in Maths and Science whose performance in those areas is better 
than their ability in English. However, their average English grade is the best among Grade 2 students. 


Weaknesses of the class: Speaking. 
Issues raised by the teacher: 


“We recruit into the elite class those students who have a good command of English; but it makes 


2) YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


both teaching and learning English harder than before in the non-elite classes because they will lack 
their good peers’ leadership and support. We usually have some good students in the ordinary classes 
who can help and stimulate their classmates. But now, with the elite class, classes have been 
polarized, and many teachers think it will be even harder under the new situation in 2014.” 


4.7. OBSERVATION 7. This vocational school in the Chienchen District offers both vocational and general 
classes to meet the students’ diverse needs. There are 14 vocational and 4 general classes per grade. Students 
do 4 hours of English per week plus 1-2 hours after classes. The school is considered the best in the 
Kaohsiung area. 

To meet the 2014 MoE policy, the school cooperates with a Chinese steel corporation. Each class of 
grade 1 students in the programs of Electrical and Mechanical Engineering will have a chance for a better 
future because they will do their practicum at the corporation. If they perform well, they may be promoted 
to full-time employees there. If they are not hired, they also have better experience and connections than 
others in the relevant fields. 

Two classes were observed at this school; the first was a vocational class. The teacher used a PC to show 
the teaching material directly on an interactive board. There were 39 students in 6-8 groups, depending on 
the activities. 


Text and Syllabus: English IV. Tongda Publishers (2012). 


Activities: Reviewing: Lesson 3, A song to help the world; Lesson 4, The world on a plate; and Lesson 5, A 
date with another woman. 

The teacher helped students to review the lessons before the test next week. To encourage them to 
actively answer questions he told them that they would get extra points. He gave different types of questions 
such as vocabulary, fill-in, sentence patterns, and pair work. 

An example of this teacher's approach is Lesson 3: 

[slide 1] tragic. Think of its other two derivatives. Student 38 replied: tragically, tragedy. [slide 2] Make a 

sentence with this word. Student 4 answered: The 1921 earthquake wasa tragedy. 

[slide 4] The possibility that something bad will happen. Student 5 replied: threat. [slide 5] The aim of 

giving money, food, help, etc. to those in need. 

[slide 6] The act of saving someone from danger or harm. 

[slide 8] It takes/took/will take.... 

[slide 9] If + S. + had (not) Vpp..., S + would/could/should/might (not) + have Vpp... Student 6 

responded: If it hadn't rained yesterday, this game would never have been held. 

The teacher helped students to make the sentences correctly. [slide 13] Expressing Needs. Pair-work 

in the classroom. 

A: I don’t know where my [student 33: pencil] is. Could you lend me yours? 

B: Sure, here you are. Do you need anything else? I have [student 32: an eraser] too. 

A: [student 33: Thanks], I [student 33: need/don't need] that right now. 

Lesson 4: 

[slide 18] Contribute vs. Distribute. What’s the difference between them? 

Make a sentence using ‘contribution’. Student 22: I did great contribution on this project. The teacher 
helped: ‘I made a great contribution to this project.’ 





Strengths of the class: Reading and speaking. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN ZS 


Weaknesses of the class: Writing and listening. 

Issues raised by the teacher: 
“How to foster students’ motivation is always important. I ask them to do discussions, anyone who 
replies first would get extra points for his/her efforts; I hope to increase their willingness to 
participate in the classroom activities. Although we have the right to choosetexts, the 
options may be changed [by the MoE] each year. The educational system is very 
complicated; if we teach different programs, the preparations are not that easy because we 
must have different types of classes to meet the students’ needs, especially if we want to 
recruit as many students as possible. It’s hard to say which teaching method would result 
in success. I apply proper strategies to different classes. It sometimes works out well when 
all factors are right. But it can’t be copied to another class, as each class is different.” 


4.8. OBSERVATION 8. This observation was of a general class at the same vocational school. The class had 37 
students. The room was equipped with a clock, fans, air conditioning, a projector and a PC. 


Text and Syllabus: English JV. Tongda Publishers (2012). 


Activities: Lesson 7, Miyazaki's world of fantasy. 

To motivate the students, the teacher used her PC to show them the work of Miyazaki, a famous 
animation master. The students watched with interest. She also provided websites for students’ reference. 
She then led them in reading an article and explained the contents to them. At the end of the class she 
reviewed the sentence pattern, and asked students to practice the sentence pattern of emphasis. 


Strengths of the class: Speaking. 
Weaknesses of the class: Writing. Reading and listening were average. 


Issues raised by the teacher: 

“It’s important to have a curriculum design. I will find something interesting for students to 
motivate them. I do 30 minutes warm-up, because if they are motivated, it will be easier for both 
teaching and learning. I won't necessarily start from the first paragraph, but I will cut in from an 
interesting one. We usually have tests during the class; what they do is ok, but they don't do well in 
their regular tests. I feel I can’t do more about it. I also help students prepare for the tests they will 
take in the future, such as practice sentence patterns and translation. Students in vocational classes 
are more motivated, so teachers are more willing to teach them; but teachers will take turns to teach 
general classes.” 


4.9. OBSERVATION 9. This general school of 7,500 students in the Sanmin District was estab- lished in 1997. It 
was evaluated as one of the best high schools in 2010. It has both vocational and general classes. The latter 
have 2 more hours in English per week than vocational classes. The school schedules 30 minutes of listening 
in the early morning for three days. 

This school’s participation in our project was enthusiastic. The director of teaching affairs was very 
active in helping. He explained that, because they have all kinds of programs and classes, it can be quite 
complicated. For example, some students enrolled in the vocational stream may also go to academic 
oriented classes. It also generates a lot of complex work in preparing documents for the annual MoE 


24. YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


evaluation. 
The class observed at this school was a general class of 49 vocational students. The classroom had fans, 
air-conditioning, a TV anda clock. 


Text and Syllabus: English Reader IV. Longtung Publishers (n.d). 
After every 2 lessons the students have a test, thus they had only 6 lessons in each semester 


Activities: Lesson 4, Are you worth your salt? 

The teacher helped students to prepare for next week’s test by reviewing the material. 

The test design is based on the university entrance exam. He used different teaching methods such as 
Question & Answer and lecturing to get students to understand the contents. 

1. Vocabulary/Phrase/Grammar: 

Murder [cold-blooded murder, unintended killing, accused of, charge with; flex [v] be flexible in, bend 
[bent, bent]; trunk [suit case/trunk of a tree]; ancestor, descendant; empathy, sympathy. 

Sentence Pattern: but, and, or [so + adj. + that, consist of, be caught/seen/spotted/heard+ V-ing. 

2. Conversation: 

Doctor: How can I help youtoday? 

Mary: 

Doctor: That’s because of the cold weather. 

Mary: [a] How do I take care of my skin in winter? [b] I have a stomach ache. [c] My face feels dry and 
itchy. [d] I need a moisturizer. [e] My arms hurt. 

3. Reading Comprehension. Multiple-choice questions were given. 


Strengths of the class: Listening and reading. 


Weaknesses of the class: Speaking and writing. 


Issues raised by the teacher: 
“Td collect the tests that were used before for vocational high school students, to let them practice, 


to help them prepare for the final test. I'd tell them the new trend of the test: non- multiple-choice 
questions, re-phrasing sentences, and reforming sentences will be included in the 2015 Test. I think 
teachers need to continuously learn, to self-adjust to meet students’ diverse needs. The more students 
with good grades, the higher the accomplishment. Some students do what they are told, while others 
don’t and I see no positive effects; that frustrates me. I consult with my colleagues to find ways for 
improvement. For example, we have a few exchange students from foreign countries; they help 
motivate our students in speaking English. Students are lacking in motivation, so they can’t avoid 
failing. There were 40 students in a class, more than 30 students didn’t pass the GEPT; only 3 
students passed it at the beginner’s level. Now our students’ levels are comparatively better than 
before. It’s both students and teachers who have been making efforts together; the vocational 
students do well in their performance. I challenge and inspire students to learn and help to 
strengthen their ability and their vocational skills; thus, I help build their self-confidence. It also 
encourages me to go further.” 


4.10 OBSERVATION 10. The last observation was of an all-girls general SHS in the Chienchin District. It is 
considered the best high school in the city for girls. This general class had 48 students. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN ie, 


Text and Syllabus: English 4. Sanmin Publishers (2012). 
Activities: Lesson 9, A glimpse into the future. 

The teacher first gave a listening test, then asked each group of students to write practice sentences on the 
blackboard, e.g., Class 217 is smarter than all the other classes in the world. Christine is prettier than other 
people. I neither like to drink alcohol, nor smoke. Neither chickens nor penguins fly. 

Following this, the class watched the video, “Did you know?’ The teacher handed out a worksheet and 
told them the vocabulary beforehand. She asked them questions after the video; they replied actively. Then 
each group had a discussion. Finally, she asked them to prepare for an oral report on the results of each 
group s discussion based on the guidelines in the worksheet: 

What are your initial reactions to what you saw in the presentation? 

What skills do you need to survive and thrive in this new era? 

We are now living in exponential times in which technology advances at such a rapid speed that much 
of the information students learn at school may become outdated by the time you graduate. In that case, do 
you feel you need to be taught differently than you are now? If yes, how? 


Strengths of the class: Reading and listening. 
Weaknesses of the class: Writing and speaking. 


Issues raised by the teacher: 

“I have tried to speak more English, to let my students listen to more English during the class, and 
I try to do as many activities as possible; I think it’s more efficient to let them learn by doing 
activities. Based on their feedback, I can find what interests them more. Students’ levels in English 
are generally high or low. The change in the way of entering into universities makes it hard for us to 
control the schedule and the depth of the contents. Their basic abilities are not as good as those who 
have graduated. They don't know how to pronounce and spell. The mistakes in their writing are so 
serious. Besides, the periods of time for English have decreased. We usually have only 4 hours per 
week, and 1 hour after the class. Not all students have 1 hour of listening.” 


5. DISCUSSION. Taiwan’s curriculum is a reflection of its philosophy, design, and objectives. It takes a 
subject-centered approach to pursue certain achievements, the consequence of which determines whether 
or not learners are smart. The objectives in learning place an emphasis on so-called academic preparation 
through a core curriculum and exams to prepare students for tertiary studies. 

In general, this subject-centered perspective is historically associated with essentialist thinking, 
according to which students should be instilled with “essential academic knowledge” that will make them 
both smart and virtuous. Learning is a matter of immersion in a set of discipline-centered subjects, 
rationalized by the embrace of mental discipline in order for students to master certain skills and facts vis- 
a-vis a pragmatic pursuit of knowledge (Hlebowitsh 2004). Given this philosophy of education, teaching 
largely employs a traditional, direct-instructional approach, as against a learner-centered approach. 

While the objectives of EFL education are to cultivate students’ interests and abilities in English and 
enable them to learn actively and effectively, these objectives are anything but practical because the general 
purpose of education in Taiwan is, as everyone knows, overwhelmingly under the shadow of academic 
pursuit through exams—as evidenced in our study in which some students have at least 3 EFL exams per 
semester. 

More often than not the classroom setting is with a blackboard in the front and students sitting in rows 
facing the instructor, like chickens in a roost, rather than as individual learners; this scenario is largely 


26 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


confirmed by our 10 observations, and our previous studies of JHS and Elementary classes. Such a setting is 
of course flawed, because if one cannot create settings in which students are the center and multiple 
interactions are maximized, how can we expect to arouse interest and motivation among learners? 

Teachers typically start their lessons with lectures or demonstrations, which are then followed by some 
practice and review. The curriculum is essentially the textbook, and exams are linked to the text. Teachers 
are responsible for the coverage of the text’s content. In this system of education, it is assumed that all 
students will acquire at least a rudimentary knowledge of English. However, as previous studies show and 
this study confirms at the SHS level, this is not necessarily the case. Thus, several components involved in 
instructional settings can be identified, each of which presents problems in practice and which the foregoing 
observations generally illustrate. 


5.1. ASSESSMENT AND OBJECTIVES. Generally, the MoE’s learning objectives are difficult to achieve in SHS 
because EFL learning is subject-based and academically oriented. The effect of exam-focused learning 
makes most SHS learners vulnerable to poor listening and speaking skills and compromises students’ 
interests, motivation, autonomy and awareness of cultural differences, as indicated by observations of and 
comments on EFL classes. In this regard, specific EFL objectives in listening, such as “Students can 
discriminate a speaker's feeling andattitude from a sentence expressed in different tones,’ are difficult to 
implement unless other components such as materials, methodologies and the assessment regime are 
coordinated and integrated. 

While the objectives of EFL for SHS differ little from those of JHS, SHS students are more advanced in 
cognitive ability (Piaget 1950), and thus English syntax and logical connectors for writing, reading and 
comprehension should be more effectively taught. But, SHS students have a greater study load and goal of 
achievement, so there is more pressure in high school settings. In our observations it was common for SHS 
learners to be constantly exposed to the pressure of exams, thus learning English was instrumental, rather than 
having any intrinsic value or interest. 

Further, the objectives are not always given equal weight in practice. This is because exams play such an 
important role in the educational system, which emphasizes high scores in tests that lead to tertiary education 
in good institutions and subsequently high paying jobs and social status. This leads to the underdevelopment 
of some of the objectives, giving rise to an emphasis on grammatical rules (in which many Taiwanese 
students excel) and reading and writing vis-a-vis listening and speaking. As a result, learners are found 
lacking in the pragmatic use of the language—which is emphasized in a learner-centered approach. The 
limited emphasis on listening and speaking—despite such an acclaimed emphasis in vocational schools— 
presents English as a subject, like physics or biology in which content can be memorized; this is contrary to 
the nature of language acquisition, which needs to be performed. 

Thus, our 10 observations suggest that the assessment protocol in SHS is not, and cannot be, practically 
applied in Taiwan. As one of the teachers noted, both general and vocational students sit the same exam, 
but their texts and the differential emphases on the 4 skills vary. From our observations, EFL teachers were 
frustrated as they tried to integrate the 4 skills in the classroom, because little progress seemed to be made, 
as several of the interviews revealed. 


5.2. CLASS SIZES AND COMPETENCY LEVELS. SHS EFL teachers are faced with the challenges of large 
classes. It is a problem that is aggravated by students of different competency levels in the same classroom. 
To reduce this difficulty, some teachers divided students into groups on the basis of competency. While our 
observations show that students at different levels can be arranged in the same group to optimize the 
interactive effects of diversity, problems also ensue because more advanced students tend to become 
frustrated in learning what they already know, and they are not trained to help less competent classmates. 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN Zp 


Nevertheless, teachers and learners could benefit from group approaches such as cooperative learning, 
as demonstrated by the Jigsaw approach. Allowing learners to form their own groups and conduct 
cooperative learning they experience non-competitive situations in which every student has the opportunity 
to succeed on the basis of his/her individual needs and differences. Practical examples of such group 
processing can be found in studies by Chang et al. (2013), Celik et al. (2013), and Kyndt et al. (2013). 

However, such an approach runs counter to the existing system of competitiveness that is driven by a 
testing mentality, which gives rise to a common tragic scenario in EFL classrooms: A helpless teacher 
standing on the stage tries his/her best to give lectures and keeps doing so, even when most students’ minds 
are wandering everywhere, as was evidenced in several of our observations. 


5.3. STUDENTS AND ROTE LEARNING. Another issue is EFL learners themselves. Most students are 
anxious to adopt rote memory for English vocabulary and behavioural patterns to cope with test items, and 
eventually they lose interest and motivation in learning English as a language. As Yashima et al (2004) 
indicate, one of the most critical issues in their learning of English is their attitude toward learning it. Since 
there is no likelihood that the exam-oriented instruction will change, it is essential for EFL teachers to foster 
integrative rather than instrumental motivation among students. 


5.4. MATERIALS AND METHODOLOGY. Regarding teaching materials, the themes in EFL texts encompass 
both local and global issues, which corresponds to one of MoE’s objectives (“To better students’ 
understanding of foreign affairs and other cultures and implant the concept of respect for humanity and 


2D cc 


global sustainability.”) For example, topics included “Being on time,” “Miyazaki’s world of fantasy”, and “Is 
your diet destroying the environment?” There seems to be no problem in the selection of themes. 

However, what is equally important is the organization of the lesson. Ideally, the arrangement of 
teaching materials in a given lesson should be based on contemporary instructional theories, and the format 
arranged to make learners feel comfortable and interested. Lessons could begin by clearly stating the 
objectives in the form of points, which will facilitate goal-setting. While this has been done well in most EFL 
textbooks, in practice the goals often are not met, or are glossed over or approached with a rote-learning, 
exam- orientated mentality. 

LFurther, the content of most EFL textbooks rarely includes sections specifically for group activities or 
discussions. This is partly because EFL in SHS is examination-based, not communication-based, the 
consequence of which is that little time is left for non- examination group activities through which much 
of what is covered in the textbook can be consolidated. 

Linked to the foregoing are teaching methods. According to our observations, most instructors began 
with ice-breaking activities to arouse learners’ interest, but there was little evidence of scaffolding (Vygotsky 
1978) that followed. From our observations, what most troubles EFL learners in SHS is not necessarily study 
overload, but the lack of true understanding of the learning input. Hence the seemingly complicated content 
can be simplified and structured, and thus more readily grasped and acquired. 

However, EFL teachers have little time or capacity to build on what students already know or to 
encourage them to create connections between the topic and personal experience. Here, the principles of 
discovery-learning could be useful. It is an inquiry- based approach in which students are led to induce 
meaning or general rules, rather than directly being told them; this should help students become explorers 
rather than receivers. As Newmann (1996) suggests, when students participate in lessons that require them 
to construct and organize knowledge, consider alternatives, engage in detailed research, inquiry, writing, 
and analysis, a positive impact on learning and effective communication can be expected. Related insights 
and examples can be found in recent studies by Lantolf (2006), and Swain et al. (2002). 


28 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


5.5.A WAY FORWARD. The foregoing observations and summations lead to suggestions on how EFL 
education in Taiwan could be improved, but this also requires further research to discern how they could be 
implemented in the existing exam-based system. 

As noted, EFL objectives are not always practical and plausible, thus education authorities must take 
into account the learners’ situation (such as pending exams) and set goals in a way that can meet both EFL 
objectives and satisfy learners’ needs, especially when the new twelve-year educational program is imminent. 

To make assessment and objectives correspond, exams must include the four skills, amongst which 
listening and speaking are given equal weight. Here, GEPT standards with a focus on listening and speaking, 
for which the GEPT is designed, might assist. 

If EFL learners believe that English is an important tool for them to communicate with people of other 
countries or as a means to acquire knowledge through English texts, then they will be more likely to study 
it as a language itself and will equally focus on the four language skills. But it is a challenge for teachers to 
convince their learners that only by learning English with equal emphasis on the four skills can they more 
effectively cope with test items. 

Further, to optimize class interactions EFL teachers must arrange classrooms to facilitate full 
interaction, but this can be difficult with a class of 40-plus. This requires teachers to help poor students 
interact with higher level counterparts, thus enabling the former to benefit from the latter. This is what 
Jigsaw techniques (Aronson & Patnoe 1997) are all about. EFL teachers can design cooperative learning 
techniques that reduce conflict or competition among students, promote better learning, improve 
motivation, and increase enjoyment of the learning experience. 

In terms of texts, Advanced Organizers (Ausubel 1963) could be presented at the beginning of the 
material to facilitate activating learners’ previous experiences. The Advanced Organizers may cover the 
statements of the relationship between what they have already learned and what is to be learned, which may 
also include different formats for learners at different levels. 

Similarly, teaching materials may include short stories of great figures or events that will stimulate 
affective responses, thus facilitating further devotion to learning the material. This is especially important 
when most SHS learners are at the developmental stage of individualization, which is prompted by 
inspiration, and in which short stories and personal reflections play a significant role. 

EFL teachers also must organize and present the learning input from easy to difficult, simple to 
complex, in terms of successive approximations of the learning units. However, such a program of 
instruction, like discovery learning, can be time-consuming, and thus may not be practical in the current 
educational structure that provides a curriculum of only 4-6 hours a week. Therefore, more thought about 
how many hours of EFL students should undertake is required, and this in turn requires consideration of 
the workload of students and the exam-mentality evident in our observations. 

These several factors are of high importance, for if teachers do not help learners through diverse methods, 
then the problems of incompetence will persist, and both teachers and students will continue to suffer the 
frustrations of failure. 

However, new approaches to learning often run counter to the existing system of competitiveness and 
examinations, and thus further research could be directed at how pedagogical principles and strategies 
could be integrated into the existing system, or how that system needs to be changed. 


6. CONCLUSION. In this paper a general introduction to Taiwan’s EFL education policy and structure in SHS 
was given, followed by a description of 10 observations and reflections by the teachers. In the end, we 
provided some reflections on SHS EFL education in Taiwan. In general, the status quo of English education 
at the SHS level features an exam-oriented mentality, and this departs greatly from the MoE’s objectives. 
Thus, there is much to be improved in terms of English education policy, objectives, teacher education, 


EFL IN SENIOR HIGH SCHOOLS IN TAIWAN 29 


student commitment, teaching materials, methodologies, and assessment. 

First, the objectives for EFL set by the MoE must be adjusted in consideration of the expectations of 
instructors and learners. Most importantly, the EFL assessment must be conducted in accordance with the 
objectives. If exam-based instruction cannot be drastically changed, then efforts should be made to work 
out a scheme in which a well- balanced development of the four skills can be achieved, while simultaneously 
facilitating taking exams. 

Second, as SHS learners reach a more advanced level in their cognitive development, EFL teachers must 
optimize such a capacity by carrying out discovery learning approaches to counteract rote learning; such 
instructional approaches can be of great effect in coordination with objectives, assessments, materials and 
methodologies. 

In summary, EFL education in SHS of Taiwan has great challenges, since heterogeneity in English 
competence, large class sizes, teacher-centered instruction, and an exam-ridden curriculum all pose 
problems to EFL learning. How to bridge the gap between sound objectives and inevitable reality is 
challenging, and thus there is still a long way to go to satisfy all parties involved. 


REFERENCES 


ARONSON,E., &S.PATNOE. 1997. Thejigsawclassroom: Buildingcooperationin the classroom (2nd ed). 
New York: Addison Wesley Longman. 

AUSUBEL, D. 1963. Thepsychologyofmeaningfulverballearning. New Y ork: Grune & Stratton. 

CELIK, S., K. AYTIN, & F. BAYRAM, E. 2013. Implementing cooperative learning in the language 
classroom: Opinions of Turkish teachers of English. Procedia - Social and behavioral sciences 70: 1852- 
59, 

CHANG, M. M., M.C. LIN, & M. J. TSAI. 2013. A Study of enhanced structured web-based discussion ina 
foreign language learning class. Computers & education 61: 232-41. 

CHENG, Y-Y., C-C. GUEY, P. MATHEWS, K. KIYONAGA, & S. SHIBATA. 2012. EFL in elementary 
schools: The case of Taiwan. Shumei University Journal #9, 44-81. 

CHENG, Y-Y., C-C. GUEY, P. MATHEWS, K. KIYONAGA, & S. SHIBATA. 2013. EFL in junior high 
schools: The case of Taiwan. Shumei University Journal#10, 19-65. 

CLARK, N., ed. 2010. Educationin Taiwan, World education news & reviews 23(4). 
http://www.wes.org/ewenr/10may/feature.htm, Accessed March 1 2014. : 

DU-BABCOCK, B., & R. DU-BABCOCK. 1987. An analysis of English language education and its impact 
on business practice in Taiwan. (ERIC Document Reproduction Service No. 274-672). 

HER, JIA- HUEY. 2007. The globalization of English: Its impact on English language education in the 
tertiary education sector in Taiwan. PhD diss., The University of Waikato. 

HLEBOWITSH, P. 2004. Design the school curriculum. Boston, MA: Pearson. 

HUANG, S. C. 1998. Senior high school students’ EFL learning beliefs: A site study. In Proceedings of 
the/* international symposium on English teaching 477-85. Taipei, Taiwan: Crane. 

KYNDT, E., E. RAES, B. LISMONT, F. TIMMERS, E. CASCALLAR, & F. DOCHY. 2013. A meta- analysis of 
the effects of face-to-face cooperative learning. Do recent studies falsify or verify earlier findings? 
Educational research review 10: 133-49. 


30 YING- YING CHENG, CHING-CHUNG GUEY, PAUL W. MATTHEWS, AND SEIJI SHIBATA 


LANTOLF, J. P. 2006. Socio-cultural theory and L2: State of the art. Studies in second language acquisition 
28: 67-109. 

LEE, JIA- YING. 2011. Moving towards a more balanced English teaching: A case from Tai- wanese EFL 
classrooms. English language teaching 4(2): 132-37. 

LIANG, TSAILING. 2002. Implementing cooperative learning in EFL teaching: Process and effects. PhD 
diss., National Taiwan Normal University. 

MATHEWS, P. W. 2015. EFL education in Taiwan: A snapshot by students. In Policy and practice: 
Education and EFL in Taiwan, ed. by Y-Y Cheng, C-C. Guey, Paul W. Mathews, K. Kiyonaga, & S. 
Shibata. Canberra: Warrior Publishers. 

MINISTER OF FOREIGN AFFAIRS OF JAPAN. 2013. Information about foreign countries and areas. 
http://www.mofa.go.jp/mofaj/toko/world_school/01asia/infoC18100.html. Accessed June 23, 2013. 

MOE. 2008. The plan to promote 12-year compulsory education. Accessed November 27, 2013. 
http://english.moe.gov.tw/ct.asp?xItem=7084&ctNode=784&mp=1 

---, 2011. “Summary of education at all Levels in SY 2011.” Accessed January 14, 2014. http:// 
english.moe.gov.tw/public/Attachment/New271115114171.doc 

---, 2012. “Education in Taiwan.” Accessed August 24, 2013. 
http://english.moe.gov.tw/public/Attachment/291017283171.doc 

---, 2013. “Education statistical indicators.” Accessed August 24, 2013. 
http://english.education.edu.tw/ct.asp?xItem=14504&CtNode=11430&mp=1 

NEWMANN,F. 1996. Authenticachievement: Restructuring schools forintellectual quality. San Francisco: 
Jossey-Bass. 

NUNAN, D. 1993. From learning-centeredness to learner-centeredness. Applied language learning 4: 1- 
18. 

PIAGET, J. 1950. Thepsychologyofintelligence. London: Routledge and Kegan Paul. 

SARANTAKOS,S. 1998. Socialresearch. 2nd ed. Melbourne: Macmillan Education. 

SAVIGNON, S. J., & C. WANG. 2003. Communicative language teaching in EFL contexts: Learner 
attitudes and perceptions. JRAL 41: 223-49. 

SCHULZ, R. A. 1996. Focus on form in the foreign language classroom: Students’ and teachers’ views on 
error correction and the role of grammar. Foreign Janguage annals 29: 333-64. 

SHIVE, G. 2000. Confucian legacy and examination reform in Taiwan. Accessed March 3 2014. 
http://newviews.freehosting.net/testing.html 

SWAIN, M., L. BROOKS, & A. TOCALLI-BELLER. 2002. Peer-peer dialogue as a means of second language 
learning. Annual review of applied Linguistics 22: 171-85. 

VYGOTSKY, L. 1978. Mind in society. Cambridge, MA: Harvard University Press. 

WANG, C. 2002. Innovative teaching in EFL contexts: The case of Taiwan. In Communicative language 
teaching in translation: Contexts and concerns in teacher education, ed, by S. J. Savignon, 131-53. New 
Haven: Yale University Press. 

YASHIMA, T., L. ZENUK-NISHIDE, & K. SHIMIZU. 2004. The influence of attitudes and effect on the 
willingness to communicate and second language communication. Language learning 54(1): 119-52. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:1.31-42 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH: 
ANALYZING HYPOTHESES WITH A CORPUS BASED STUDY 
AROLDO DE ANDRADE 
Universidade Federal de Minas Gerais 


Abstract: This article aims at providing a preliminary study of different hypotheses dealing with the decay of 
topicalization from Old to Modern Spanish. With the help of a database composed of letters from the 15" 
and 16" centuries, some considerations about their theoretical and empirical adequacy are made. Finally, a 
new hypothesis is proposed, emphasizing on the loss of verb movement, and opening some further questions 
for research. 


Keywords: Syntactic change, Spanish, Topicalization, Left Dislocation, Scrambling. 


Languages: Spanish, French, German, Catalan. 


DESPITE THE RENEWAL of interest in historical syntax in recent years, some issues have not yet received a 
grounded explanation. The loss of topicalization, which took place in many Romance languages, including 
Spanish, is a noticeable theme of this sort. We know that, until the 16™ century, topicalizationasin (1) existed 
side-by-side with focalization asin (2)’: 
(1) Y esas mismas le doy yo en la manera que puedo... (Carvajal 14, $2) 
and these ones 2SG.DAT give I in the manner that can.1SG... 


‘And these I give to you in the way I can.’ 


(2) Persecuciones grandes hubo David en su principio. (Del Pulgar 4.°, §5) 
persecutions big had David in his beginning “David 


faced big persecutions in his beginnings.’ 


(3) Al philédsopho pertenece probary atnporfiar lo que dixere 
to the philosopher imputes.3sg prove and even strive for the that says.3sg 
masal buen caballero no pertenece porfiar, sino defender. (Guevara 5, §2) 
but the good knight does not impute.3sg strive for, but protect 


' Examples followed by the author name are found in the primary references list, at the end of the text. The database is available in 
the following link: https://dl.dropboxusercontent.com/u/1097038/Corpus_Espanol_15-16.xlsx 


32 AROLDO DE ANDRADE 


‘To the philosopher it is imputed proving and striving for what he says, 
but to the good knight it is not imputed to argue, but to protect.’ 


The usual account of these sentences is as follows. The fronted constituent in (1) is usually taken to be 
a topic because it contains an anaphoric element. If the element is a new piece of information, as the fronted 
DP in (2), it is an information focus. If the element is new and at the same time put in contrast with another 
element, it is considered to be a contrastive focus as in (3). 

In this context, the expression of Information Structure is changed in Modern Spanish in the contexts 
shown in (1) and (2), although (3) remains unaltered. The change can be easily shown in examples 
corresponding to (1)-(2) expressed respectively in Modern Spanish as in (4)-(5)? 


(4) Y esas mismas se /as doy en la manera que puedo... 
and these ones se 3pl give.1sg I in the manner that can.1SG... 


(5) En su principio David tuvo persecuciones grandes. 
in his beginning David had persecutions big 


In other words, topicalization was replaced by dislocation, a construal with a resumptive pronoun co- 
referent with the topic; and focalization cannot be expressed on the left side of the sentence, but only to its 
right.’ I intend to develop only the first of these two changes. 

The goal of this paper is to investigate this problem with the help of a database of marked syntactic 
constructions construed from a corpus of letters written by authors born in the 15" and 16™ centuries. This 
period was chosen because change from Old to Modern Spanish is situated around then. The analysis 
considers existing hypotheses for the decay of topicalization in Spanish—which sometimes aim at 
explaining Romance languages generally—with regard to their empirical and theoretical adequacy. 

The text is organized as follows. In section 1, I present two previous analyses for the decay of 
topicalization, framed in the studies working with the syntax-information structure interface. Section 2 
presents the methodology for the corpus research. Section 3 includes the quantitative and qualitative results. 
Section 4 discusses some theoretical implications from the proposals and puts forward a third hypothesis. 
Finally, section 5 concludes the text. 


1. THEEXISTING HYPOTHESES AND THEIR PREDICTIONS. Two hypotheses have been put forward to 
explain the loss of topicalization, in favor of the use of left dislocation. I analyze them succinctly in the 
following, so as to pave the way for the corpus-based research. 

The first approach is functional in the sense that it correlates the use of informational values and 
syntactic positions, which I dub hypothesis A (cf. Danford 2002). Basically, this analysis conjoins two 
observations made in previous works: (i) that [+focus] elements are dislocated from argumental positions 
(Fontana 1993) and (ii) that resumption in Spanish comes along with specific elements (Sufer 1988). 
Notwithstanding that, in 15"/16"- century Spanish, the preverbal constituent could be mapped to Spec, IP 
as well, due to the loss of V-to-C movement (ULT 201vb in Danford 2002: 137):* 


*In (4) the indirect object /eis represented with the seallomorph. Alternatively, it is dropped, which is the most frequent option. Both 
are strategies to overcome the Person-Case Constraint (Bonet 1991). 


° ‘Rocalization’ is intended to mean only informational focalization, and ‘dislocation’ is restricted to left dislocation. 
* This kind of construction is typical of the 15" and 16" centuries, according to Danford (2002). See section 4 for further discussion. 


Regarding focus and contrast elements, Danford states (p. 124) that there would be no clear change from Old to Modern Spanish 
(contra Sitaridou 2011, a.o.). 


THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH 33 


(6) Eesto fazien por razon de los ricos omnes que estauan en .Naples... 
and this did.3pl for reason of the rich men that were.3pl in N. 


‘And they were doing this because of the wealthy men that were in Nablus...’ 


For Danford (2002), the nature of resumption (as in assumption (ii) above) continued to be the same; what 
changed was the expression of information structure, in that topic elements began to be mapped to 
adjoined, base-generated positions instead of in specifier positions created by movement. Consequently, 
the syntax must use a resumptive so that the object can receive its theta-role from the verb. 


(7) Hypothesis A: Syntax-to-information structure mapping 


a.[pO V... [ve ]] > O = focus; topic; contrast 
b. [ww O [w V-cl... [ve ]]] > O =topic 


I thus interpret that the deeper reason for change is a functional pressure for syntacticization of the 
distinction between different types of informational-structural values, as a consequence of adjunction. 
Although the author does not explore this problem explicitly, it becomes clear in Sitaridou (2011), where the 
loss of focalization is analyzed. 

On the other hand, a purely formal approach has been proposed for the loss of topicalization in Old 
Spanish (and in most Romance languages), that I dub hypothesis B (Mensching 2012). Designed in the 
recent minimalist theory of syntactic phases (Chomsky 2001, et seq.), according to which the sentence is 
divided into two domains that are visible for the interfaces, this proposal also suggests that, in order to be 
displaced to the left periphery, a fronted topic constituent must first stop by at the phase edge; the loss of 
an optional movement feature at vthen impedes topicalization, and a Clitic Phrase becomes a phase head, 
replacing vP in its role (in this scheme and in the one in (15), subscripted numbers on the same element 
indicate its copies, where 1 stands for the finalposition)’: 


(8) Hypothesis B: Syntactic derivation of information structure 


a [re O1 [vp O2 
b. cp O; [ap O2cl [re [vp Os J] 


toot 


In the simple representation above (devoid of other syntactic material, including the verb), O3 corresponds 
to the object in situ (postverbal) position; O2 to the intermediary position (previously scrambled), and O: 
to the fronted position. As a consequence, one predicts that, in the same period where there were fronted 
objects, we should also find scrambled objects. These latter elements can in fact be identified whenever there 


> Phases are identified by their maximal projections: the Little verb Phrase (vP), including the verb and its arguments, and the 
Complementizer Phrase (CP), including all the functional categories of the sentence. The ‘edge’ corresponds to all material in their 
specifier positions or the head of these categories. The proviso “fronted topic” in Menshing’s (2012) rendering is due to a further 
assumption, according to which operator movement involved in focus-movement and wh- movement would be free to leave the vP 
through an ‘escape hatch’, without having to stop at Spec, vP. 


34 AROLDO DE ANDRADE 


is a periphrasis with intervening material between the two verbs (example from Libro de Buen AmorI-191a 
in Batllori, Sanchez, and Sufer 1995 apud Mensching 2012: 22): 


(9) ... porque ella non avia las cartas rescebidas 
.. because she not had the letters received. FEM.PL 


‘,.. because she had not received the letters’. 


In order to perform corpus analysis on this theme, I identify the predictions that follow from each theory. For 
hypothesis A, only the functional pressure to distinguish information- structural values such as topic, focus, 
and contrast would be relevant for the change, and not the semantic value of the topic referent (considered 
as necessarily specific). Regarding hypothesis B the prediction is clear: we must find, at the same period, both 
constructions in (1) and (9). In fact, if the first step depicted in (8a) is a precondition for the second one, it 
is also possible that (9) exists longer than (1). 


2. THEORETICAL AND METHODOLOGICAL FRAMEWORK. Explanations for the availability of marked- 
topic constructions have been developed in different domains: syntax, pragmatics and discourse analysis (cf. 
Hidalgo Downing 2003 for a summary). Much of the syntactic work has been developed in the generative 
framework (as can be grasped from the above presentation), which takes into consideration some pragmatic 
aspects, especially to make a distinction between topic and focus. For the goals of this study, I also assume 
this theory and methodology, because it allows plenty of options for a specific explanation of the syntax- 
information structure interface. 

The methodology for the identification of information-structural notions follows the traditional view, 
in that topics correspond to old information, whereas focus correspond to new information. Contrasts (i.e. 
contrastive foci) consist of elements comprising the exhaustive selection of one of the members of a list of 
alternatives.° 

This methodology is worth pursuing if the decisions taken are explicitly thought over. A problematic 
aspect consists in the codification of accessible elements, which form a heterogeneous class. The decision 
taken involved dividing accessible items into two subclasses: those that imply the creation of a complex 
discourse model, including anchored and contextual accessible entities, were taken as new information, thus 
marking a focus; whereas those that are can be related to a partially-ordered set in the discourse model of 
the listener were taken as old information (cf. Birner & Ward 1998). Note that the use of a model based in 
the identification of the questions guiding the discourse at each step implies making very specific 
assumptions that are sometimes unclear for historical texts, a reason for always combining this approach 
with the traditional one, anyway. 

The corpus comprises three compilations of letters for each century (15™ and 16"), collected from 
Biblioteca Cervantes Virtual. The database was made by manual search, including direct or indirect objects 
before a finite verb or a non-finite verb (infinitive or participle in a compound tense), thus excluding 


© Two observations about contrast: this category does not include contrastive topics. Second, it does include cases of verum focus, 
a category proposed for Modern Spanish (Leonetti & Escandell-Vidal 2007) but whose status in Old Spanish is unclear. Verum 
focus expresses emphasis on the whole predicate and is possibly found in confirmative utterances such as the following one: 
(i) Contad, muy reuerendo sefor, vuestros dias antiguos, count, 

much reverend sir, your daysold 

y los afios de vuestra vida considerad. (Del Pulgar 3.°, $2) 

and the years of your life think.over.imp.2pl 

‘Count, my reverend sir, your old days, and consider the years of your life.’ 


THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH 35 


constituents that cannot be resumed by a clitic, such as oblique arguments - cf. (10a). Only elements with 
connectivity between the peripheral element and a sentence-internal position were selected, thus excluding 
hanging topics and hanging topic left dislocations - the latter type being illustrated in (10b): 


(10) Non-selected data 
a. Del conde me avise vuestra merced (Carvajal 23, $3) 
of.the count 1sg warn.sbjv Your Mercy 
‘As for the count, may Your Mercy warn me.’ 
b. ...lo que es el pleito nunca canso con é/a nadie (Carvajal 6, $6) 
...what is the pledge never disturb. 1sg with it to nobody 
‘...what (is related to) the pledge, I never disturb nobody with it.’ 


The reduced number of selected sentences allows a detailed comparison between cases of dislocation 
and topicalization, which are both possible with direct and indirect objects. For the classification, I took 
into account the construction type (topicalization, focalization, dislocation, contrast), the grammatical 


function of the peripheral element (direct or indirect object) and the semantic type of dislocated object 
(definite or generic DP). 


3. THE EMPIRICAL CONTRIBUTION. The database collected according to the methodology depicted 
above included almost 260 sentences, in which I have found the marked structures shown in Table 1: 


Table 1. Summary of results 


Construction 15" century 16™ century 
Contrast 9 16 
Focalization 57 42 
Dislocation 35 32 
Topicalization vA Al 
Scrambling Z I 

TOTAL 124 132 


The results from Table 1 are given in absolute numbers because no comparison is made between preverbal 
and postverbal objects, and because the listed configurations are not mutually exclusive. However, it is 
worth comparing dislocation and topicalization, which are minimal pairs. Unlike the first expectations, the 
number of non-resumed topics increases from 28% to 56%. 

For the analysis of hypothesis A, I have found three examples from Guevara that suggest that changes 
in the semantic type of topic plays a role in the frequency of resumption: 


36 AROLDO DE ANDRADE 


(11) Dislocation of generic constituents 
a. Al caballero que es animoso, esforcado y valeroso, 
to.the knight that is brave, zealous, and valiant, 
nunca se /e ha de encender la colera...(Guevara 5, §2) 
never se 3sg.dat have of turn.on the fury... 


‘The brave and zealous knight never shall let the fury appear to him.’ 


b. y moneda no consentian esculpir/a 
... and coins not allow.past.3pl mint+3sg.fem 
sino al que hubiese vencido alguna famosa batalla... (Guevara 3, §2) 
but to.the that had.past.3sg won.pp some famous battle 
‘They did not allow anyone to mint coins but those that have won some famous 
battle.’ 


These examples include bare nouns in the left periphery with a clear [-specific] meaning, thus going against 
one of the assumptions guiding hypothesis A, according to which Sufer’s (1988) model of resumption 
continued unaltered from Old into Modern Spanish. However, these data do not fully invalidate Danford”s 
hypothesis that the loss of V-to-C is connected to the change. 

Consider now the analysis of hypothesis B. Although the number of examples representative of 
scrambling is rather small, they become valuable from the analysis of a text on past participles. In fact, 
exactly at the turn of the 16" century, which Batllori (1992: 104) identifies as the change to the loss of 
intermediate DPs in sequences involving the bona fide auxiliaries ter, ser, and estar plus a past participle, I 
have also observed the last cases of scrambling with an auxiliary-like verb followed by an infinitive, which 
were the only cases found in these texts: 


(12) Scrambling 
a. no quiere otra cosa decir sino que todos le sirvan de voluntad 
... not want another thing to.say but that all 3sg.dat serve of will 
y él use con todos de liberalidad (Guevara 2, $12) 
and he uses with everybody of generosity 
‘,..he does not want to say anything else but that all serve him 
willingly and that he may be generous with everybody.’ 
b. y podian todos dellas comer (Guevara 4, $10) 
... and could everybody of.them eat ‘ 
‘,..and everybody could eat of them.’ 
c. ...y porque por carta no se puede esto especificar, basta esto. 
... and because by letter not can this specify, suffices this (Avila IV,§3) 
‘...and because by letter one cannot specifiy this, only this is enough.’ 


This kind of example is only mentioned in a footnote in Mensching (2012) because it involves assuming a 
structural analysis for ‘restructuring’ verbs (modal verbs in the examples above). I assume a reduced (vP) 
structure for the complement of these verbs (cf. Andrade 2010), thus making them similar to participles in 
compound tenses, except for the fact that the ‘restructuring’ verb merges in the lexical head V, instead of in 
the functional head T, as it is the case with bona fide auxiliaries.’ 


’ This analysis has also the advantage of dispensing with a degrammaticalization hypothesis for the complement of ‘restructuring’ 
verbs (cf. De Andrade & Fischer To appear). 


THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH a7 


Since Avila is the last author to use scrambling, one would suppose, considering hypothesis B, that 
topicalization would disappear at the same time, or even before that period, but this is not what we find in 
data of posterior authors: 


(13) Topicalization 
a. Todo esto miraba yo estando dos pasos 
all this looked I being two steps 
dentro de aquella tercera pieza (De la Puente 14, $24) 
inside of that third room 
‘I looked at all this while I was two feet inside that third room.’ 


b. y toda la tarde entera gastamos en hablar 
and all the afternoon entire spent.1pl in talk 
de las santas Recoletas de Medina con un gusto grandisimo. (Carvajal 21, §2) 
of the saints secluded.fem of Medina with a taste very.great 
"... and the entire afternoon we spent in talking about the nuns from Recoleta de 
Medina with great enjoyment.’ 


th 

In summary, the loss of scrambling takes place at the beginning of the 16 century, and topicalization 
still continued in use afterwards. This makes sense, because scrambling is crosslinguistically more marked 
than topicalization. This seems to be important evidence against the prediction of hypothesis B. 


4. DISCUSSION. The data discussed above suggests that an explanation for the loss of topicalization is 
Spanish is still in need of further exploration. However, considering that our corpus evidence may be 
deemed too scarce, I discuss some theoretical problems that seem to compromise these proposals. 

The problem with hypothesis A is that it supposes that information structure tends to be expressed in 
grammar in a maximally clear way. This poses a problem in the case of V2 languages, where the constituent 
in first sentential position (P1) may correspond to either a topic or a focus (or even a contrast, not illustrated 
below; German examples from Hetland 2003:15): 


(14) Kannst du jemandem in Bristol? 
know.2sg you anybody in Bristol * 
Do you know anybody in Bristol?’ 


a. Meine Eltern kamen aus Bristol. (P1 = focus) 
my parents come.past.3pl from B. 
b. Aus Bristol kamen meine Eltern. (P1 = topic) 


from B. come.past.3pl my parents 
‘My parents come from Bristol.’ 


Most Germanic languages have kept their V2 status for centuries, a fact suggesting that the trigger for 
change cannot be just the need to map syntactic positions with information- structural values. Second, this 
hypothesis assumes either that the difference between specifiers and adjuncts plays a role in language 
acquisition—which does not seem realistic—or that learners read only the word-order string (cf. Danford 
2002 and Sitaridou 2011), although many studies have shown that semantic and prosodic information guide 
this process. 

Hypothesis B seems to make a correct assumption: that information structure works with a spectrum 


38 AROLDO DE ANDRADE 


of variation offered by the grammar, and not the other way around. In other words, the reason for the 
change is to be found in independent structural reasons. However, the replacement of the role once played 
by the vP in Old Spanish by a Clitic Phrase is completely unexpected, and especially so under the traditional 
version of phase theory, since that projection is higher than TP. 

A third proposal for the loss of topicalization can be developed, though, integrating aspects of the two 
mentioned hypotheses, namely the loss of V-to-C and phase theory. However, proponents of a V2-like 
grammar for Medieval Romance have presented the loss of XP-V order as being itself the trigger for the loss 
of V-to-C (Adams 1987:26, for French). In this approach, the replacement of topicalization by dislocation 
comes as a corollary from two basic assumptions: (i) that marked topics are merged in the CP domain; and 
(ii) that left-dislocated topics are base-generated in their surface position (Hirschbiihler 1975, Cinque 1990, 
a.0.). 

Although this theme has been already discussed in numerous ways, I would like to propose that instead 
of being the trigger for the loss of verb movement, the story goes the other way around. To do so, I assume 
the version of phase theory dubbed “phase extension” (Den Dikken 2007), according to which the syntactic 
phase depends on the reach of the displacement of a given head in the structure. This means that in the 
period when V-to-C movement took place, the phase, normally limited to vP, would be extended to CP, 
allowing the movement of other elements to the Spec, CP position. 


(15) Hypothesis C: Information structure is limited by the syntax 


a.[CP O1 V PP os [vP O2 |]] 
[CP Ol [TP O2cl1 V [vP cl2] 


“tay ' 


For some reason—possibly due to a very general drift in the Romance area—V-to-C gets restricted to 
V-to-T. In this new scenario, the object on its way up to C finds a ‘barrier’ in TP. In this sense, the only way 
to have a marked topic construction would be to base-generate the topic CP domain and insert a coindexed 
clitic.® 

A crucial difference between this proposal and the alternative ones is that the loss of the V-to-C period 
takes place only by the end of the 16" century. Therefore, fluctuations in the usage of marked syntactic 
constructions may be attributable either to individual preferences or to the activation of specific projections 
in the left periphery. 

During Old Spanish, the occurrence of resumptive pronouns seems to depend, in a cartographical 
analysis, on whether the topic is in a higher clausal position receiving base- generated constituents. The 
example below illustrates the context of the sentence in (11b), in which there was resumption. The topic 


® The difference between adjunction and specifier is relevant in the Government and Binding framework because the topic/focus 
element would govern its trace, provided that there is no intervening element (such as a subject). A minimalist proposal can also 
dispense with Torrego’s (1988) theory of ‘big DP’—where the DP and a pronominal copy are generated together— Consequently, 
it is possible to maintain the rule of predication in Chomsky (1977), that requires coindexation between the external topic and a 
clause-internal operator. 


THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH 39 
moneda (‘coin’) is contrasted to edificios (‘buildings’): 


(16) En dos cosas trabajaban los romanos de dexar y perpetuar sus 
memorias; in two things worked the romans of leave and 
perpetuate their memories es, a saber, en edificios que hacian y en 
monedas queesculpian... 

is.3sg, to know, in buildings that made.3pl and in coins that sculpted.3pl 
(Guevara 3, §2) 

‘Romans worked in two things so as to perpetuate their legacy: 

namely, in the buildings they made and in the coins they minted.’ 


By the 15" century, Spec,KP, a position of the left periphery for (moved) contrastive topics, became 
available whereas Spec, TP was still available as a position for contrastive foci. These two factors helped to 
increase the amount of non-resumed preverbal objects. Finally, an independent grammatical change affects 
bare singular nouns, which may have an existential or a generic reading (cf. Cohen & Erteschik-Shir 2002). 
In Modern Spanish, these elements came to have only a generic reading in Modern Spanish, being 
compatible only with a topic position. I follow Casielles-Suarez (2004) in that non- resumption is apparent 
in this case and derives from a null partitive clitic, parallel to what is expressed by an overt clitic in other 
Romance languages such as Catalan (examples from Rivero 1980: 365 and Casielles-Suarez 2004: 81): 


(17) a. Dinero dicen que no tiene. 
money say.3pl that nothas.3sg 
b. (De) diners dicen que no en té. 
of money say that not part has.3sg 
‘They say he does not have (any) money.’ 


Therefore, in Modern Spanish, topics cannot move to CP, but can only be base- generated in the left- 
periphery, thus necessarily involving the usage of a resumptive pronoun - cf. (18a). On the other hand, 
Focus Fronting remained unchanged and involves movement to Spec,TP (cf. Camacho-Taboada & 
Jiménez-Fernandez 2014), which continued being available in this grammar - cf. (18b). 


(18) a. Esto (lo) digo queesbueno. (O=Topic) °*OSp* MSp 
this (3sg) say.1sg that is.3sg good 
‘I say that this is good.’ 
b. Esto no comprendo. (O = Contrast) °S OSp °§ MSp 
this not know. 1sg 
‘I do not know this.’ 


This proposal may be further strengthened by corpus data from different periods of Spanish, and it may 
indeed replicate similar results for French (cf. de Andrade 2018). 


5. CONCLUSION. This study provided a review of different approaches to the decay of topicalization from 
Old Spanish into Modern Spanish. With the help of a small corpus of letters written in the 15" and 16" 
centuries, which is usually taken as the period of transition between Old and Modern Spanish, I have 
discussed the empirical and theoretical adequacy of two hypotheses put forward for the decay of 
topicalization in Spanish. After observing problems in the two proposals, I consider a third hypothesis, 


40 AROLDO DE ANDRADE 


which is partially based on insights from the two previous works and which capitalizes on the loss of V-to- 
C as the triggering factor for the loss of topicalization (instead of the other way around, as many works 
proposing a V2-like grammar for Romance languages have suggested). Two apparent pieces of 
counterevidence suggested from the database were discussed and shown to be not prima facie incompatible 
with the new hypothesis. This proposal, although still in need of further testing, may be applied to the 
change in other Romance languages. 


ACKNOWLEDGMENTS. I would like to thank Fapesp for financial support from project number 
2013/24637-8, the audience of LACUS 41 for the interest in this research and Marie Labelle, Paul 
Hirschbiithler and Gustavo Beritognolo for discussion of related issues. All remaining errors are my 
responsibility. 


REFERENCES 


A. Primary Sources (Preceded by author abbreviated name and year of birth) 


AVILA (1500) AVILA, JUAN DE. 1999. Epistolario Espiritual. Alicante: Bib. Virtual Miguel de Cervantes. 
http://www.cervantesvirtual.com/nd/ark:/59851/bmcgq6s7. 

CARVAJAL (1566) CARVAJAL Y MENDOZA, LUISA. 1999. Epistolario de Luisa de Carvajaly Mendoza. 
Alicante: Bib. Virtual Miguel de Cervantes. 
http://www.cervantesvirtual.com/nd/ark:/59851/bmc804z8. 

DE LA PUENTE (1554) DE LA PUENTE, LUIS. 1999. Epistolario. Alicante: Bib. Virtual Miguel de 
Cervantes. http://www.cervantesvirtual.com/nd/ark:/59851/bmchm545. 

DEL PULGAR (1436) DEL PULGAR, FERNANDO. 1999. Letras. Alicante: Bib. Virtual Miguel de 
Cervantes. http://www.cervantesvirtual.com/nd/ark:/59851/bmcpn918. 

GUEVARA (1480) GUEVARA, ANTONIO DE. 1999. Libro Primero de /as Epistolas Familiares. Alicante: 
Bib. Virtual Miguel de Cervantes. http://www.cervantesvirtual.com/nd/ark:/59851/bmcqz258. 

SAN PEDRO (1437) SAN PEDRO, DIEGO DE. 1999. Carcelde Amor. Edited by Pablo de Colonia, Juan 
Pegnitzer, Magno Herbst and Tomas Glockner. Alicante: Bib. 

Virtual Miguel de Cervantes. http://www.cervantesvirtual.com/nd/ark:/59851/bmc222s8. 


B. Secondary Sources 


ADAMS, MARIANNE. 1987. From Old French to the theory of Pro-drop. Natural language and linguistic 
theory 5, 1-32. 

ANDRADE, AROLDO. 2010. A subidadecliticosem portugués: umestudosobrea variedadeeuropeiados 
séculosX VIaXX. PhD diss., Universidade Estadual de Campinas. 

BATLLORI, MONTSE. 1992. Preliminary remarks on Old Spanish auxiliaries: haber, serandestar. Catalan 
Working Papers in Linguistics. 1992: 87-112. 

BENINCA, PAOLA. 2004. The left periphery of Medieval Romance.” Studi linguistici e filologici online 2: 
243-297. 


THE DECAY OF TOPICALIZATION FROM OLD TO MODERN SPANISH 4] 


BIRNER, BETTY, & GREGORY WARD. 1998. Jnformation status and noncanonical word order in 
English. Amsterdam/Philadelphia, PA: John Benjamins. 

BONET, EULALIA. 1991 Morphology atter syntax: Pronominal clitics in Romance. PhD diss., 
Massachussetts Institute of Technology. 

CAMACHO-TABOADA, VICTORIA & ANGEL L. JIMENEZ-FERNANDEZ. 2014.Focus Fronting and Root 
Phenomena in Spanish and English In Language Use and Linguistic Structure: Proceedings ofthe 
OlomoucLinguistics Symposium, ed. By Joseph Emonds and Markéta Janébova, 47-60. Olomouc: 
Palacky University Press. 

CASIELLES-SUAREZ, EUGENIA. 2004. The syntax-information structure interface: Evidencetrom Spanish 
and English. New York: Routledge. 

CHOMSKY, NOAM. 1977. On wh-movement. In Formal syntax, ed. by Peter W. Culicover, Thomas 
Wasow, and Adrian Akmajian, pp. 71-132. New York: Academic Press. 

---.2001 Derivation by phase. In Ken Hale: A life in language, ed. by Michael Kenstowicz, 1-52. Cambridge, 
MA: MIT Press. 

CINQUE, GUGLIELMO. 1990. Types of A' dependencies. Cambridge: MIT Press. 

COHEN, ARIEL, & NOMI ERTESCHIK-SHIR. 2002. Topic, focus, and the interpretation of bare plurals. 
Natural language and linguistic inquiry 10: 125-165. 

DANFORD, RICHARD K. 2002. Preverbalaccusatives, pronominal reduplication, and information 
packaging: A diachronic analysis of Spanish. PhD diss., Ohio State University. 

DE ANDRADE, AROLDO, & SUSANN FISCHER. To appear. Degrammaticalisation indices in Iberian 
complex predicates? In Fromcomposite predicates to verbalperiphrasesin Romance languages, ed. by 
Mar Garachana, Sandra Montserrat, and Claus D. Pusch. Amsterdam/Philadelphia: John Benjamins. 

DE ANDRADE, AROLDO 2018. Aboutness topics in Old and Middle French: A corpus- based study on 
the fate of V2. Canadian Journal of Linguistics 63 , 194-220. 

DEN DIKKEN, MARCEL. 2007. Phase extension: Contours of a theory of the role of head movement in 
phrasal extraction. Theoretical linguistics 33: 1-42. 

FONTANA, JOSEP MARIA. 1993. Phrase structure and the syntax of clitics in the history of Spanish. PhD 
diss., University of Pennsylvania. 

HETLAND, JORUNN. 2003. Contrast, the fall-rise accent, and information focus. In Structuresoftocusand 
gramimaticalrelations,ed.byJorunn Hetlandand Valeria Molnar, 1-40. Tiibingen: Max Niemeyer. 

HIDALGO DOWNING, RAQUEL. 2003. Latematizaciénenelespanolhablado:estudio discursivo sobre el 
espatiol peninsular. Madrid:Gredos. 

HIRSCHBUHLER, PAUL. 1975. On the source of lefthand NPs in French. Linguistic Inquiry 6:155-165. 

LEONETTI, MANUEL, & VICTORIA ESCANDELL- VIDAL, 2007. Fronting and verum focus in Spanish. In 
Focusand backgroundin Romance Languages, ed.by Andreas Dufterand Daniel Jacob, 155-204. 
Amsterdam: John Benjamins. 

MENSCHING, GUIDO. 2012. Parameters in Old Romance word order. In Parameter Theory and Linguistic 
Change, ed. by Charlotte Galves, Sonia Cyrino, Ruth Lopes, Filomena Sandalo and Juanito Avelar, pp. 
21-42. Oxford: Oxford University Press. 

RIVERO, MARIA LUISA. 1980. On left-dislocation and topicalization in Spanish. Linguistic inquiry 
11:363-393. 

SITARIDOU, IOANNA. 2011. Word order and information structure in Old Spanish. Catalan journal of 
linguistics 10: 159-184. 

SUNER, MARGARITA. 1988. The role of agreement in clitic-doubled constructions. Natural language and 


42 AROLDO DE ANDRADE 


linguistic theory 6:391-434. 
TORREGO, ESTHER. 1988. Pronouns and determiners: A DP analysis of Spanish nominals. Ms., 
University of Massachusetts, Boston. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:1.43-51 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





THE PICNIC LANGUAGE 
CHRISTINA KEPPIE 


Western Washington University 


Abstract: In 1982, New Brunswick playwright Rino Morin Rossignol published the one-act play Le pique- 
nique. This play has been shown to accurately represent the ideological discourses in New Brunswick’s three 
French Acadian regions. The present article takes a brief look into the morphosyntactic accuracy of the play 
by examining the presence or absence of two flexions common in Acadian dialect studies: je...ons for first 
person plural and /(/s)...ont for third person plural. Previous research points to a cessation of use of these 
flexions in New Brunswick but their occurrences by the Chorus members from the North-East and the South- 
East in Le pique-nique suggest that these morphosyntactic features of Acadian French may still be in use in 
New Brunswick. This literary analysis demonstrates how literary works can support, confirm, and 
compliment sociolinguistic fieldwork in cultural and linguistic research. 


Keywords: Acadia, Acadian, identity, New Brunswick, morphosyntax, desinence, Rino Morin Rossignol 


Languages: English, French 


LONERGAN (2013) PAYS HOMAGE TO THE NUMEROUS ACADIAN ARTISTS who have helped 
propel Acadia into and through modernity in Acadie 72: Naissance de la modernité acadienne. He 
justifiably discusses how these artists have assured Acadia’s cultural survival and recognition in the 
globalizing Francophonie by representing their neighbor, the average New Brunswick Acadian, in the 
creation and publication of their works through local publishing houses. However, one shortcoming of 
Acadie 72 is that with the exception of Antonine Maillet’s 1971 famous series of monologues, La Sagouine, 
Lonergan dedicates very little effort to address how the use of the Acadian dialect(s) in artistic writing has 
reinforced or reflected actual Acadian identity. La Sagouine is acknowledged as the first artistic evidence of 
cultural tension/difference in French New Brunswick, published during a time when Acadia and the rest of 
French Canada were defined primarily in its opposing relation to the dominant English-speaking 
community, known as the modernizing period (Heller & Labrie 2003, Keppie 2011). At present, New 
Brunswick Acadians have generally adopted a more embracing and positive discourse, having realized that 
globalizing social, economic, and cultural advancement/survival is not a likely outcome of the modernizing 
period’s more nationalistic platform. With 1981’s Bill 88, a linguistic community equality law adopted by 
the provincial Legislative Assembly, the globalizing period began to take root and grow throughout French 
Canada (Heller and Labrie 2003). Within New Brunswick Acadia, this ideological period was evidenced by 
acknowledging and promoting the economic benefits of bilingualism and cooperation between the English 
and French linguistic communities. 


1. LE PIQUE-NIQUE. At around the very time that the globalizing period began emerging in New 


AA CHRISTINA KEPPIE 


Brunswick, Le pique-nique (Rossignol 1982) was published by Les Editions Perce-Neige. It is a one-act satire 
composed of dialogues regarding cultural and social differences in Acadia and the lack of respect that arises 
from communication problems surrounding these differences: “En fait, 7 écrivais des dialogues... Le pique- 
nique est une piéce sur les différences. Sur les problémes de communication qu engendre le non-respect des 
dittérences’ (Rossignol 2014). In research on New Brunswick Acadian identity, debates surrounding the 
diverse meanings and understandings of the terms Acadie/Acadien ‘Acadia/Acadian’ as well as Acadian 
membership and acadianité have been greatly mediated by the community elite (intellectuals, politicians, 
historians), giving rise to /7dentité collective acadienne ‘collective Acadian identity (Thériault 2007). Until 
lately, very little attention has been given to actually discussing what is meant by hybridity in Acadian 
identity, as perceived by the general Acadian population through an anthropological approach (Keppie 
2013a and 2013b). In New Brunswick, this general population is communally viewed as comprising three 
culturally-distinct regions: the North-East, the South-East, and the North-West. 

These regions also stand as the three ‘members’ of Rossignol’s chorus triumvirate (Je Choeur), 
representing the Vox Populi ‘voice of public opinion’ of French New Brunswick, the image of public 
opinion. In Le pigue-nique, the role of the Chorus echoes that of traditional Greek theater, yet while 
frequently acting as a homogeneous group in their dialogues with the other characters, the Chorus also 
splinters into its three distinct members, bickering and bantering with each other as they discuss, among 
other topics, the meaning of Acadia and what it means to be Acadian. 


2. LE PIQUE-NIQUE'’S ACADIANITE. Recent studies have inarguably shown that each French- speaking 
region of New Brunswick interprets acadianité“Acadianness ideologically and hence, differently, resulting 
in an overall hybrid identity within New Brunswick alone. Acadians of the North-East understand twenty- 
first century acadianité in a very traditional sense, considering ancestral ties to the 1755 Acadian Expulsion 
as the central criteria of Acadian membership or identity. The Acadians and Brayons of the North-West 
have been shown to maintain the modernizing perspective that branded French Canada during the 1960s 
and 1970s, a perspective that considered acadianité in geographical and linguistic opposition to English- 
speaking New Brunswick. The South-East, however, due to its more urban and multilingual composition, 
is characterized by having a very globalized opinion in regard to Acadian identity which, as previously 
stated, took root in French New Brunswick in the 1980s and has since grown in popularity throughout the 
province to the point that it collectively defines French New Brunswick (Keppie 2011, 2013a). As a whole, 
there is no longer any dispute that New Brunswick Acadia, and the South-East in particular, consider the 
economic advancement of the whole Acadian community as a higher priority than the necessity of language 
maintenance, proof of Acadian lineage, and an officially (politically) defined Acadian territory. New 
Brunswick Acadians have collectively warmed to the continuous linguistic contact between English and 
French speakers and now advocate cross-linguistic community membership, acknowledging the increasing 
economic benefits for Acadia on the global scale that bilingualism has afforded (LeBlanc 2009). 

While written in 1982, the different perspectives, values, and beliefs of the three Chorus members in fact 
compliment the regional meaning systems that currently characterize French New Brunswick in the early 
twenty-first century (Keppie, to appear). Acadianitéis discussed at length throughout the satire, centering 
on several major themes, all of which play an important role in the ideological positions discussed above: 
territory, language, lineage, cultural centers and regional membership. Le piqgue-nique is, quite frankly, a 
cultural work of art in that it is a current mirror into the New Brunswick Acadian identity. Upon 
communicating with the playwright, I learned of the joyful ease at which Rossignol put pen to paper and 
wrote the play in a single night: 


J étais seul chez moi, et j ai grittonné sur un papier: L’Acadie se marie’ en le prononcantdans ma téte 


THE PICNICLANGUAGE A5 


AAA . 


avec accent Acadien — ‘L’Acadie se maaarie’. Etj'ai tout de suite pensé et écrit: I “Acadie s'‘met 4 rire, 
prononcé ‘sm aaa rire’ qui sonne plus comme [accent brayon du Madawaska. Et ¢a ma fait rire. Et 
sans trop savoir ni pourquoi, ni comment, j ai écrit: Ladies and gentlemen, Lord Durham’. Etla, tout a 
déboulé de maniere naturelle. C’était comme une écriture automatique. Les phrases sontapparues, les 
unes apres les autres. Etj écrivais tout ¢a en riant aux éclats. Etles personnages ont suivi, tout aussi 
naturellement. Je ne me suis pas posé de questions. Tout surgissait sur ma feuille au bon moment 


(Rossignol 2014).' 


3. THELANGUAGE OF LEPIQUE-NIQUE. My communication with Rossignol was born out of curiosity 
about his writing process. If he were able to depict the actual public opinion through /e Choeur so well and 
in a single night, could the play be linguistically representative of New Brunswick Acadia as well? Does Le 
pique-nique showcase linguistic features known to New Brunswick Acadia? When I asked Rossignol about 
his attention to the actual language used in Le pigue-nique, he responded that for several months after that 
initial night of hilarious writing, he carried a typed copy of the play everywhere he went, constantly adding, 
correcting and refining so that each character reflected the accent he envisioned: 


PourleChoeur jemesuisbeaucoupinspirédemesamis BrayonsduNord-ouestet Acadien du Nord-est 
e t Sud-est gui venaient a Fredericton ou je passaisbeaucoup de temps avec eux au River Room de 
[hotel Beaverbrook, alire, relire, commenter, parler politique, rire et boire de la biere, et bien des gens 
ont contribué par leurs commentaires ame faire préciserles idiosyncrasies locales et les accents régionaux 
en particulier ... Ence quia trait a Paul, c est mes mots, ma voix, monton,monaccentbrayon... Parmi 
les différences quisontmisesenscéne dans la piece, les accents jouent un role particulier. Ilsjettentune 
couleur sur les propos des personnages, mais ils ne sont pas le but du propos, comme tel ... 
Toutefois, l’'accent constitue une différence, et comme toute différence, la question de l’accent peut 
devenir aussi une cause de discrimination (Rossignol 2014). 


Even though Rossignol spoke of writing the play with Acadian and Brayon accents in mind, a 
phonological analysis of Le pique-nique was not my intention when I communicated with the playwright. I 
was interested more in the Acadian linguistic features evident on paper which were brought to my attention 
by my students as we spent several weeks reading and analyzing the play in an Acadian culture class at 
Western Washington University. The play's lexicon and regional expressions were definitely studied at 
length but what the students found even more curious (and often difficult to wrap their heads around) were 
the morphosyntactic differences between the French they had been learning as undergraduates (Standard 


' Translation: ‘I was home alone and I scribbled on a piece of paper: ‘L’ Acadie se marie’, pronouncing it in my head with the Acadian 
accent - “L’Acadie se maaarie’. And right away I wrote: “L’Acadie s’met a rire’, pronounced ‘s’m’aaa rire’ which sounds more like 
the Brayon accent of Madawaska. And that made me laugh. And without really knowing why, or how, I wrote: “Ladies and 
gentlemen, Lord Durham’. And from there, everything took off, quite naturally. It was like automatic writing. Sentences appeared, 
one after the other. And I laughed the whole time I wrote. The characters followed, also seamlessly. I didn’t ask myself any questions. 
It all flowed out on paper at the right moment. 

* Translation: ‘For the Chorus, I was very much inspired by my Brayon friends from the North- West and my Acadian friends from 
the North-East and South-East who used to come to Fredericton. I would spend a lot of time with them in the River Room of the 
Beaverbrook Hotel reading, rereading, commenting, talking politics, laughing and drinking beer, and so through their comments, 
many people helped me refine the local idiosyncrasies and the regional accents in particular ... When it comes to Paul, those are 
my words, my voice, my tone, my Brayon accent ... Among the differences which are addressed in the play, the accents play a 
particular role. They add richness to the words of the characters, but they are not the purpose of their words, as such ... However, 
accent constitutes a difference and like all differences, the question of accent can also be a cause of discrimination.’ 


46 CHRISTINA KEPPIE 


French - SF) and the structure of Acadian French (AF) they were encountering in Le pique-nique. Therefore, 
in a sense, my students inspired this study of the Acadian morphosyntactic features present in Rossignol’s 
play as I was able to explain not only the relationship between regional identity and discourse (such as I've 
focused on in past research) but also the relationship between regional identity and linguistic features, 
making for a very well-rounded cultural analysis of acadianité 


3.1. AN OVERVIEW OF ACADIAN MORPHOSYNTACTIC FEATURES. The present 
study addresses the presence or absence of the most commonly studied morphosyntactic 
features in the Acadian language found within Le pique-nique. the traditional flexions known 
as je...ons and ils...ont, whose uses are both determined by the socioeconomic factors of 
education and age. 


3.1.1. JE... ONS. With its origins in the northern parlers do//(Flikeid & Péronnet 1989), this first-person 
plural flexion is widely believed to have all but disappeared from the two eastern New Brunswick Acadian 
French varieties ( henceforth referred to as FANENB and FASENB), while still persisting in other Acadian 
varieties, such as in Newfoundland and Nova Scotia (Beaulieu & Cichocki 2008; King, Nadasdi, & Butler 
2004; Beaulieu & Balcom 1998; Flikeid & Péronnet 1989). This traditional vernacular variant remains highly 
stigmatized, especially in New Brunswick, where Acadians have received higher education and institutional 
support. With this combined stigma and institutional support which together pressure the use of the 
personal pronoun on and the third-person singular conjugation to be used in place of je... onsor even nous 
... ons, the use of this vernacular variant is not a conscious decision (Flikeid & Péronnet 1989). 


3.1.2 I(L)... ONT. The flexion -ontfor third-person plural has also been documented as a common feature 
of XII-century parlers doil from the Central West, as opposed to je ... ons, which originated more in the 
North (Flikeid & Péronnet 1989). This feature has maintained its popularity over the centuries and is 
currently observed in all AF varieties (Weismath 2006). It also presents an advantage in oral communication 
by demonstrating a larger optimization between third-person singular and third-person plural radicals. The 
many constraints on its use, however, make its full use challenging for non-Acadians in comparison with 
French from France (FR). 


Table 1. Third-person plural variation in FA and FR, using pécher ‘to fish’ to illustrate 
(Adapted from Beaulieu & Cichocki 2008). 


Third-person plural variation Varieties observed 
a. Les pécheurs péchent Both FA and FR 


b. Les pécheurs i péchent FA and most oral varieties of FR 
c. Les pécheurs i péchont FA only 
d.__Les pécheurs péchont 





Found in the present, imperfect and present conditional tenses, the desinence ont in lieu of ent for third 
person plural is used more frequently with IR/RE verbs (such as i(/) disont) than with ER verbs, despite the 
examples found in Table 1 (Beaulieu & Cichocki 2005).* The use of the ont flexion, simultaneously referred 
to as ‘archaic’ and ‘typical’ in AF, also has some variation in the pronoun associated with it. The s marking 
the plural in the pronoun ils is frequently absent from speech (Flikeid & Péronnet 1989), resulting in the 


> The desinence iontis also observed in the present subjunctive (Flikeid & Péronnet 1989). 


THE PICNICLANGUAGE AT 


pronoun being reduced to 7(b) and thus eliminating liaison if followed by a vowel. The post- verbal ontis 
actually said to exist only in conjunction with the pre-verbal 7 such as we see in (c), making (d) not a possible 
or observed option (Beaulieu & Balcom 1998). In NENB, /- V-ontis openly (and institutionally) stigmatized 
in closed social and linguistic networks yet concurrently favored through its use by mainly middle-aged and 
older male speakers as a sign of in-group solidarity (Beaulieu & Cichocki 2008; King, Nadasdi, & Butler 
2004; Beaulieu & Balcom 1998; Flikeid & Péronnet 1989). Therefore, in sum, the desinence ontis controlled 
by its social value, governed in NENB by 1) a closed network and 2) the desire to optimize verbal opposition. 
FASENB, on the other hand, is less marked by standardization than FANENB; the region is 1) conservative 
in that it retains many traditional FA features but is also 2) innovative because FASENB is characterized by 
more new linguistic features than is FANEB (Péronnet & Kasparian 1998). In fact, FASENB has a rather 
healthy use of oné its proportion of use by the older generation (+60 years) was found to be 0.70 by Flikeid 
& Péronnet (1989). 


3.2. DESINENCE IN LE PIQUE-NIQUE. In the following short sections, the different French- speaking 
characters of Le pique-nique are analyzed to see whether the two vernacular Acadian flexions are present 
in their speech. Considering that Rossignol attempted to write the characters as true to their regions as he 
could and based on the research summarized above, instances of i/... ontare likely to be found in the lines 
of both the NE and SE Chorus members but occurrences of je... onsare doubtful. Also, since there has been 
no documentation of these two flexions in NWNB, which is not considered Acadian (Keppie 2013), it is 
improbable that neither the NW Chorus member nor Paulwould use either flexion. As for the Bishop 
(Evéque) and Malo, vernacular flexions are not expected: the former represents the educated social class 
which is not characterized by traditional, archaic AF features and the latter resides in NWNB. 


3.2.1. LE CHCEUR (NE-SE). Throughout the play, the North-East and South-East Chorus members speak 
in what has been shown to be traditional, vernacular AF. In Le pique-nique, we find evidence of not only a 
categorical use of the pre-verbal z but several expected cases of the flexion ont and two cases of je ... ons, 
which were not anticipated. In the example below, spoken by both Eastern Chorus members, we see the 
flexion ont used in the third- person plural conjugations of connaitre ‘to know and avoir ‘to have’, which 
reflect the variations only found in FA, as described in Table 1(c): 


Worry pas ton tchoeur, Malobianah. Les Brayons i’ sont toutes fuckés. I’ connaissont pas leus 
histoire zeux pace gui?’ navont pas dhistoire! Pis ceux gui en avont un petit brin, well, @ a 
comprennont pas. I’ sont toutes boloxés entre les Anglais, les Irlandais, les Ecossais, les Américains, les 
Francais, les Québécois, pis les Indiens. ’ avont pas le sens de Ila race (Rossignol 2001: 23).* 


The two instances of je... onsillustrated below, were not expected, because previous study found this flexion 
to have all but disappeared from FANB varieties due to stigma and institutional support. They appear in 
different conversations within Le piqgue-nique. In the first, the Chorus members are attempting to one-up 
each other by stating what makes their region special and the South-Eastern member, using /‘avions, proudly 
states that they are the home to Magnetic Hill, a gravity hill optical illusion very popular with tourists. In 
the second example (7 serions), both Eastern members gang up on Paul as they hear his dialect, declare him 


* Translation: ‘Don’t worry your little heart, Malobianah. The Brayon are all fucked up. They don’t know their own history because 
they don’t have any history! And for those who have a tad, well, they don’t understand it. They re all mixed up between the English, the 
Irish, the Scottish, the Americans, the French, the Quebecois, and the Indians. They don’t have any sense of race. 


A8 CHRISTINA KEPPIE 


not one of their own, and mock him for his sexual orientation: ” 


Nous autres, j avions la Magnetic Hill. 
‘Us, we have the Magnetic Hill.’ 


Quoisséqu avoulaitencoreyelle. Une vraie Québécoise! Pire! Une Francaise! Pis d'un 
autre coté, j serions point surpris qu c arriverait straight des States, ¢a. 

‘What does she want this time? A true Quebecois! Worse! A Frenchwoman! 
On the other hand, we wouldn't be that surprised if that came straight from 

the States!’ 


These occurrences of je... onsin Le pique-nique indicate that while the flexion may still retain a certain 
degree of stigma and is used unconsciously, it probably still exists in FANB, in particular in FASENB, as 
both cases of je... onsin the script were found when the South- Eastern Chorus member is speaking. The 
flexion may not be as prevalent as in other FA varieties, but to say that it has disappeared from FANB 
varieties may not be entirely accurate just yet. One cannot make this claim from sociolinguistic research, 
yet artistic cultural reflection should not be discredited. 


3.2.2. LECHCEUR (NW). There are no observed cases of either flexion in the speech of the North-West 
Chorus member. Nowhere does the Chorus use these flexional endings when speaking together as a group. 
When NW is part of the Chorus, neither i/... ontnor je... ons are present. 


3.2.3. EVEQUE. As expected, the Bishop, the one character in Le piqgue-nique representing the educated 
social class, does not partake in any traditional, vernacular flexional endings. The language used in his lines 
is representative of Standard (Acadian) French, such as is illustrated by the use of entfor the third-person 
plural flexion, reflecting (a) in Table 1: “Ze Seigneur m‘appelle, entin, les soeurs m/appellent.” (Rossignol 
2001:70).° What is interesting to note, however, is that the Bishop sticks to the most formal (written) variety 
and does not even use a pre-verbal flexion after /es soeurs to reflect popular FA and most FR varieties, such 
as les pécheurs i péchentin Table 1(b). 


3.2.4. PAUL. This Brayon character, who was written to reflect Rossignol himself, is only shown to use the 
pre-verbal 7 as any kind of vernacular flexion, e.g. in “sauf gue moé j‘sais pu pantoute si que chu tune 
Acadienne, une Brayonne, une Québécoise, une Fran¢aise ou un mouton, parce qui disent que chu pas comme 
les autres” (Rossignol 2001:43), where Paul attempts to explain to the Chorus that everybody is entitled to 
their own identity and categorization can lead to communication challenges.’ 


3.2.5. MALO. As expected, Malo’s speech shows no instance of either i/... ontor je... ons. However, her 
third-person plural pre-verbals are all limited to the vernacular, yet very common 4 suchas“ Oui, mais sont-i 
Canadiens? and “I’m ontdit queles Anglais sontpas des Acadiens’ when in discussions with /e Choeur about 
the meaning of being Acadian (Rossignol 2001:23, 33).Therefore, the desinence used by Malo is more 
phonetic than morphosyntactic. 


> (Rossignol 2001 :18, 31) 
° Translation: ‘The Lord is calling me, actually, the nuns are calling me.’ 


’ Translation: ‘Except that me, I don’t know at all if ’m an Acadian, a Brayon, a Quebecois, a French or a sheep because they say 
that I am not like the others.’ 


THE PICNICLANGUAGE A9 


3.2.6. CULTURAL AND LINGUISTIC UNITY BETWEEN THE CHARACTERS. In general, each 
character is linguistically representative of their region and yet they are all united through certain vernacular 
morphosyntactic features, save for the Bishop. There are three features, to be exact. Firstly, the third-person 
pre-verbal i is found throughout the entire play; there are no instances of either the standard ils or the 
vernacular il. Secondly, nous is never used as a first-person plural pre-verbal pronoun; solely on. These two 
features, however, are widely recognized throughout the francophone world as common, oral features to 
the point that they are almost no longer considered vernacular. The third shared feature, conversely, is not 
globally recognized and is shared by the Chorus and Malo, but not Paul. The second-person plural of the 
irregular RE verbs faire ‘to do’ and dire ‘to say/tell’ in the present tense are conjugated with the endings ez 
consistently throughout Le pique-nique rather than the standard flexion fes. This is illustrated in the 
following three examples. In the first Malo struggles to understand the geographical and genealogical 
criteria to being Acadian, in the second the Chorus berates Malo and Paul for not speaking as they would 
prefer, and in the third the Eastern Chorus members show theirannoyance for the Brayon’s cultural pride:° 


Malo: Benvous avez dit quec était en Acadie pisluii’ dit qui’s en va chez Victoria pis vous disezquii’sen 
va-t-enAngleterre pis vous disez quelesAnglaissontpas des Acadiens, mais les Brayonsi ontune étoile 
pourles Anglais suleur drapeau pis vous disez qui sont des Acadiens. C’tu-¢a? 

Malo: “Well, you said that it’s in Acadia and him, he says that he’s going to Victoria and you say that 
he’s going to England and you say that the English aren’t Acadians, but the Brayon, they have a star 
for the English on their flag and you say that they are Acadians. Is that right?’ 


Choeur: Silence tapette et sauvagesse! Vous nous faisez honte! Apprenez a parler. Apprenez! 
Apprenez! 
Chorus: ‘Silence faggot and savage! You embarrass us! [...] Learn to speak. Learn! Learn!’ 


N-EetS-E: Bon, lacolonnedu Madawaska quigrimpedansle buckwheatencore! Ah! Les Brayons, j vous 
dis que vous faisez chier l’'Acadie! 

N-E and S-E Chorus: ‘Well, the idiot from Madawaska is climbing in the buckwheat again. Ah, you 
Brayon! I’m telling you that you piss off Acadia!’ 


While there is very little research, if any, on the use of this flexion in (Acadian) French varieties, they have 
been shown to occur in Acadian folk songs and poetry, such as in “Le Frolic a houker” by twentieth-century 
folk songwriter Leah Maddix from Prince Edward varieties, they have been shown to occur in Acadian folk 
songs and poetry, such as in “Le Frolic a houker” by twentieth-century folk songwriter Leah Maddix from 
Prince Edward Island. In the following excerpt from “Le Frolic a houker”, we see that Maddix not only uses 
the traditional form vous faisez in the present, but also in the imperative, giving rise to the question of 
whether other irregular verbs follow a similar flexion in the present and imperative:’ 


Jai-t-une conseil a vous donner 
Si vous faisez des frolics 4 houker, 
Faisez-les paslaprés-souper 


I have a piece of advice for you, if you have hooking parties, don’t have them after supper. 


* Rossignol (2001 :22, 49, 53) 


” De premiére main, Prince Edward Island government website. 


50 CHRISTINA KEPPIE 


CONCLUSION. Going back to the playwright’s explanation of his writing process, I believe in fact that 
Rossignol understated the importance of regional accents playing “a particular role” in his literary work. 
Dialect representation may not have been a primary objective of writing Le pigue-nique, yet it greatly 
enhances the accuracy he achieved in portraying the many facets of how Acadian identity is represented 
within New Brunswick alone and the idiosyncrasies of the Acadian/New Brunswick French varieties. I 
would argue that linguistic representation in literary works plays in fact a great role in cultural studies in that 
it can help support, confirm, and complement sociolinguistic research, increasing the dialogue around the 
importance of supporting the arts and their contribution to communities of minorities. Yet, the path taken 
in Acadian Studies by scholars reflects very much the problem faced in Acadia today. In Acadia, (economic) 
partnerships throughout the Acadian diaspora need to be fostered in order to realize the full potential of 
Acadia as a globalized community and indeed maintain it—the objective of the 2014 World Acadian 
Congress (Keppie 2014). In Acadian Studies, research could be strengthened and unified by combining the 
(literary) arts with the social sciences, thus contributing to the discussion of Acadia as a globalized 
community. In this particular case, Le pique-niqueis a perfect marriage between previous linguistic analysis 
of Acadian/New Brunswick French (for example, Beaulieu & Cichocki 2005 and 2008, Flikeid & Péronnet 
1989) and recent work on the relationship between territory, region, and identity (Keppie 2011, 2013a, and 
2017) and Lefebvre 2010a and 2010b). 


REFERENCES 


BEAULIEU, LOUISE, & PATRICIA BALCOM. 1998. Le statut des pronoms personnels sujets en frangais 
acadien du nord-est du Nouveau-Brunswick. Linguistica Atlantica, 20: 1-27. 

BEAULIEU, LOUISE, & WLADYSLAW CICHOCKI. 2005. Innovation et maintien dans une communauté 
linguistique du nord-est du Nouveau-Brunswick. Francophonies d’Amérique 19: 155-75. 

BEAULIEU, LOUISE, & WLADYSLAW CICHOCKI. 2008. La flexion postverbale -onten francaisacadien: 
uneanalysesociolinguistique. TheCanadianJournalofLinguistics 53.1: 35-62. 

De premiere main, Prince Edward Island government project : 
http://www.gov.pe.ca/firsthand/index.php3?¢number=43682&langF 

FLIKEID, KARIN, & LOUISE PERONNET. 1989. N’est-ce pas vrai qu’il faut dire ‘J’avons été’ ?: Divergences 
régionales en acadien. Francais Moderne 57.3/4, pp. 219-228. 

HELLER, MONICA, & NORMAND LABRIE. 2003. Langue, pouvoir et identité: une étude de cas, une 
approche théorique, une méthodologie. In Discours et identités, la francité canadienneentremodernitéet 
mondialisation, ed. by M. Heller & N. Labrie, 9-40. Cortil-Wodon: E.M.E. 

KEPPIE, C. 2011. Understanding the meaning of Acadie. Journal of Canadian Studies 45.1: 200-27. 

---. 2013a. Meaning Systems of Two Identity Concepts: Acadie versus Acadien. American Review of 
Canadian Studies, 43.3: 315-33. 

---, 2013b. L’Acadie communautaire: Today's New Brunswick Speech Community. Jnternational Journal 
of Community Diversity, 12.3: 55-63. 

---, 2014. The World Acadian Congress 2014 ... Now What? Paper presented at the annual conference of 
the Association for Canadian Studies, Charlottetown, Prince Edward Island, November 2014. 

---, 2017. Le Choeur of New Brunswick Acadia — Regional Dynamic of Group membership in Le pique- 
nique, LACUS forum 40: 351-80. 

KING, RUTH, TERRY NADASDI & GARY R. BUTLER. 2004. First-person plural in Prince Edward Island 


THE PICNICLANGUAGE 5] 


Acadian French: The fate of the vernacular variant je ... ons. Language Variation and Change, 16: 
237-55. 

LEBLANC, MATTHIEU. 2009. « Bilinguals Only Need Apply ? » : Luttes et tensions dans un lieu de travail 
bilingue en Acadie du Nouveau-Brunswick. Francophonies d'Amériques 27 : 77-103. 

LEVEBRE, MARIE. 2010a. « Moncton: identité bilingue et capital linguistique ». In Gilbert (ed.), Territoires 
francophones: études géographiques sur la vitalitédes communautés francophones du Canadaed. by 
Anne Gilbert, 191-208. Québec: Septentrion, pp. 191-208. 

---, 2010b. « Identité et territoire dans | Acadie contemporaine ». In Territoires francophones: études 
géographiques sur Ia vitalité des communautés francophones du Canada ed. by Anne Gilbert, 308-35. 
Québec: Septentrion. 

LONERGAN, DAVID.2013. Acadie 72:Lanaissancedelamodernitéacadienne. Sudbury: Prise de Parole. 

PERONNET, LOUISE, & SYLVIA KASPARIAN. 1998. Vers une description du « francais acadien »: Analyse 
des traits morphosyntaxiques. In Francais d’Ame rique: variation, cre olisation, normalisation, ed. by 
Patrice Brasseur, 249-60. Avignon: Centre d'études canadiennes. 

ROSSIGNOL, RINO MORIN. 2014. Written interview. 

---. 2001 (1982). Le pigue-nique. Moncton: Editions Perce-Neige. 

THERIAULT, JOSEPH-YVON 2007. Faire société: société civile et espaces francophones. Sudbury: Prise de 
Parole. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:1.52-60 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





THE RHETORICAL FUNCTIONS OF THE INDEFINITE ARTICLE: 
A COGNITIVE LINGUISTIC APPROACH 
ADRIAN LOU 
University of British Columbia 


Abstract: This paper challenges the common notion that the indefinite article marks only unfamiliar and not 
readily identifiable content. Using a cognitive linguistic approach, I argue that indefinite articles, which lack 
a stable reference point, can be used to evoke and address a variety of different entities. When constructed in 
a very specific way, the indefinite can shift between the speaker’s and audience’s viewpoints, resulting in a 
rhetorical technique that allows the speaker to establish a connection with the audience. My data, which come 
from political speeches, show how politicians rhetorically mobilize the indefinite article to redirect the focus 
onto the audience by addressing the audience with the first-person plural pronoun after deploying multiple 
indefinite NPs. Ultimately, the paper suggests a possibility that the indefinite article has similar pragmatic 
and cognitive functions as the definite article, despite contrasting with it grammatically. 


Keywords: indefinite article, cognitive linguistics, viewpoint, rhetoric, political discourse 


Language: English 


TRADITIONAL THEORIES IN LINGUISTICS tend to strongly distinguish the functions of the indefinite 
article and the definite article (Abbott 2006:1). Generally, the indefinite article a/an/g is said to mark 
unidentifiable referents, typically introducing “new referents into the discourse” (Givén 1993:203). 
Indefinite articles signify the presence of new information, a referent presented in speech that has not yet 
been introduced. The speaker, “[assuming] that the hearer cannot identify the referent” will, in this type of 
situation, use the indefinite article (Brinton & Brinton 2010:328). The definite article the, on the other hand, 
can sometimes behave like a demonstrative (this, that, these, those) and denote given, or previously 
identified, information. Although there is nothing incorrect about these assumptions, Mira Ariel and 
Richard Epstein have criticized traditional explanations of how the definite article works, claiming that 
these definitions are destabilized in certain discursive situations. Epstein writes, “most (if not all) studies of 
the definite article in English share the same guiding intuition that definiteness is essentially just a matter 
of reference” (Epstein 2002:334). Epstein cites a number of scholars who have maintained that no approach 
has been satisfactory in explaining all the uses of the definite (Epstein 2002:344). 

Research on the indefinite article has not had the same scholarly treatment as the definite article. One 
reason for this might be because the indefinite article is often characterized as merely the grammatical 
antithesis of the definite. However, I contend that the indefinite article can do much more than indicate new 
information or unidentifiable referents. In fact, Epstein’s cognitive conceptualization of the definite article 
can be used to understand the indefinite article as well. Using Fauconnier’s mental space theory, Epstein 
uncovers the cognitive process by which the definite article can be employed to identify referents that are 
neither unfamiliar nor unidentifiable (Epstein 2002:333). The present study extends Epstein’s theoretical 


THE RHETORICAL FUNCTIONS OF THEINDEFINITE ARTICLE 53 


work by examining an analogous anomaly, discursive instances where the indefinite article is used to refer 
to referents and subject matter that are both familiar and identifiable. 

This paper investigates these idiosyncratic uses of the indefinite article by utilizing the framework and 
concepts that Ariel and Epstein have put forward: the theory of Accessibility (Ariel 1988) and Epstein’s 
cognitive linguistic approach to the definite article. I argue that the indefinite article, despite lacking a stable 
reference point, can be used to evoke and address a variety of different entities. When constructed in a very 
specific way, the indefinite can shift between the speaker’s and audience’s viewpoints. Using data from 
political speeches, I demonstrate how this rhetorical indefinite article is used by politicians as a way to 
generalize specific and identifiable events, enabling the speaker to conceptually subsume the entirety of the 
audience, which can then be either praised or censured. Ultimately, the paper suggests a possibility that the 
indefinite article has similar pragmatic and cognitive functions as the definite article, despite contrasting 
with it grammatically. 


1. COGNITIVE FRAMEWORK. In the same vein as Epstein, the empirical evidence I have compiled 
consists of real-life examples - all of which are drawn from American political speeches. My evidence 
explicitly shows instances in which the indefinite article is followed by identifiable information. This 
approach, to a certain extent, fittingly complements Epstein’s: his paper looks at how the definite article can 
defy the notion that the is not restricted to identifiability or given information; my research, conversely, 
looks at how the indefinite article does not have to be unidentifiable or new information. Epstein’s article, 
using Fauconnier’s Mental Space theory as a foundation, posits three alternative functions for the definite. 
These include discourse prominence, role/value status, and viewpoint shifting. In this section, I will 
summarize these relevant theories, which will be synthesized to form my own methodology. 


1.1. MENTAL SPACE AND DISCOURSE PROMINENCE. Discourse prominence is the presentation of 
new information with a definite article as a way to highlight its eventual topicality. For instance, we can 
begin a conversation with a question like this: “Did you hear about the fight?” Although “the fight” is new 
information, it is presented as an identifiable referent. Epstein argues that this type of statement, when 
uttered, induces the listeners to cognitively store the information in what Fauconnier calls a mental space. 
Mental spaces can be later accessed by a cognitive access path, which connects the discursive material to the 
mental spaces already formed. This example of the mental space with the definite article informs the listeners 
that the NP will become a prominent topic. The access path enables the listener to access this information 
again when the speaker refers to it later in the discourse. 


1.2. ROLE/VALUESTATUS.Secondly, Epstein explains that the definite article can indicate an entity’s role 
or value function. Role/Value status is intricately connected to Ariel’s theoryof Accessibility (Ariel 1998). In 
refining the definition of definite articles to account for cases where traditional theories cannot, Ariel 
theorizes an Accessibility hierarchy, which places certain words onto a stratified spectrum that ranges from 
high to low. For Ariel, words with high Accessibility indicate that the receiver in a discursive situation has 
sufficient knowledge about who or what the speaker is referring to. For example, pronouns typically have 
high Accessibility. When using a pronoun (the anaphor), the speaker - in felicitous conditions - assumes 
that listeners can easily access the antecedent. Low Accessibility marker, on the other hand, signifies the 
absence of identifiable content that has previously been made known. Definite descriptions and proper 
nouns often indicate low Accessibility, because they are normally used in situations where listeners might 
be unable to locate the referent. This might be due to the fact that the antecedent is too far back, the referent 
is not a salient topic, or the NP is new information. In such cases, the speaker “must supply the addressees 
with the richest possible context” (Ariel 1988:69) and proper nouns help pinpoint the entity being discussed. 


54 ADRIAN LOU 


A very basic example to demonstrate the difference between role and value would be the NP the 
President of the United States. If used in 2014, the president would refer to Barack Obama. If used in 2004, 
it would be George W. Bush. The term president, therefore, would be the ro/e and the actual person it refers 
to is the va/ue. From an Accessibility perspective, roles are markers of high Accessibility, because if the role 
(the president) is presented instead of the value (Barack Obama), the addresser believes the addressee 
possesses enough contextual knowledge to know who or what is being talked about. 

As a result, the definite article, according to Epstein, can shift between role and value. He provides a 
discursive instance of this role/value shift in this example where a disgruntled voter exclaims: “’'m the guy 
who is footing the bill” (Epstein 2002:359). The presence of the definite article might initially suggest that 
the guy in the utterance is a unique referent that has value status. However, such value interpretation would 
mean that the guy is literally the only one footing the bill. For Epstein, what is more likely is that the guy 
conceptually embodies the collective population of taxpayers and not an individual taxpayer. 


1.3. VIEWPOINT SHIFTING. Furthermore, the way articles can signify new and given information 
suggests that substantial viewpoint activity can occur, particularly in regard to how perspectives are 
accommodated. One of the literary reasons for using given information and definite references to begin a 
text is to generate an in medias res effect (Short 1996:266). The definite references have the ability to situate 
the audience into the midst of the story’s narrative events, which places the audience in the know. Take this 
example of in medias res from the first lines of Margaret Atwood’s The Handmaid's Tale: “We slept in what 
had once been the gymnasium. The floor was of varnished wood, with stripes and circles painted on it, for 
the games that were formerly played there” (Atwood 1985:1). Commencing narration with definite articles 
not only demands the formation of multiple mental spaces, but it also situates the readers into the story, 
inducing them to take up the perspective of the narrator. 

As Epstein explains, the definite article, in a situation like this, pushes the readers into the viewpoint of 
the narrator because the referents “are identifiable only to the narrator of the story, not the readers” (Epstein 
2002:363). Additionally, abrupt changes in article use can also indicate a shift to the discourse protagonist’s 
point of view. Again, I will attempt to simplify a complex example from Epstein’s data. This one is a news 
report, and it reads, “Andy Dotson ran out of his burning house. However, he had forgotten his blanket 
with ¢he distinctive penguin design.” Here, the signals a subtle shift from the reporter’s viewpoint to the 
discourse protagonist’s viewpoint. Epstein maintains that the indefinite article, had it been used in its place, 
would make the NP a subjective judgment of the news reporter. However, the definite article here shifts the 
viewpoint away and implies that Dotson, the discourse protagonist, and not the news reporter, is the one 
who thinks the blanket has a distinctive design. Epstein’s conclusions about the definite article will form the 
basis of my argument for the indefinite. For my project, I propose that all these functions of the definite 
article are transferable and can be used to explain how the indefinite article operates as well. 


2. POLITICAL DISCOURSE EXAMPLE. The speech that I will draw heavily from is then- President-elect 
Barack Obama’s victory speech, given the night of his election victory in 2008. The most prominent and 
most frequent use of the indefinite article comes near the end of his speech where he discusses the life of a 
106-year-old African American woman named Anne Nixon Cooper. Obama rhetorically mobilizes the 
story and life of Cooper as a way to navigate through a century of American history. Example (1) is a portion 
of the Cooper narrative: 


(1) Aman touched down on the moon, a wall came down in Berlin, a world was connected by our own 
science and imagination. Yes, we can! 


THE RHETORICAL FUNCTIONS OF THEINDEFINITE ARTICLE 55 


Here we might notice that the historical events being evoked, though new discourse referents, are not 
exactly unidentifiable. Clark & Marshall’s pragmatic theory on referencing proposes that the way we 
reference things can be explained by three different types of “mutual knowledge” (Clark & Marshall 
2002:450). The mutual knowledge Obama references here would be considered “community membership” 
knowledge - a type of knowledge “all community members are assumed to possess” (Ariel 1988:66).' Given 
that (1) contains unambiguous allusions to iconic events, these indefinite NPs are obviously not indicators 
of unidentifiable entities. Clearly, Obama is referring to events that are known by the audience. But why 
would a speaker reconfigure community knowledge into a NP that grammatically implies it is unfamiliar 
information? Rephrasing (1) with definite descriptions would produce something that is also 
comprehensible, and perhaps, presumably more appropriate. The accessibility of definite descriptions 
eliminates any possible taxing mental processing of content. The following revision of (1) is an example of 
what a definite rephrasing might look like. 


(2) Neil Armstrong touched down on the moon, the Berlin Wall came down, our world was connected by 
our own science and imagination. 


While this rephrasing of Obama’s speech revises the original example to clarify these nominal entities, 
(2) might not be much easier to make sense of than (1) for most audiences. This precise example complicates 
conventional understanding of the indefinite article as merely a denotation of new referent and/or 
unidentifiable NPs. Traditional theories would most likely assert that (2) is a more grammatical 
construction than (1), and that (1)’s reconfiguration of community knowledge into indefinite NPs subverts 
the felicity of the speaker. Of course, viewing Obama as a noncompliant speaker in such an important speech 
would be equally faulty. Something else is happening here, and re-appropriating Epstein’s methodology 
might help make sense of this anomaly. 


3. ANALYSIS. 


3.1. NARRATIVE MENTAL SPACE AND VIEWPOINT SHIFTING. This peculiar use of the indefinite 
article, which for all intents and purposes is actually definite and identifiable, should not be seen purely as 
a grammatical irregularity but as a rhetorical strategy that produces two distinctly different effects. On the 
surface, these lines, which contain highly evocative frames and images, make up the biographical and 
historical narrative Obama wants to tell. However, by employing the indefinite article, Obama conceptually 
generalizes the historical significance of these events, reshaping the narrative into a collectively vicarious 
one that all Americans have experienced. This latter effect is achieved through the indefinite article's ability 
to shift the viewpoint away from the discourse narrator to the audience. Because the indefinite is normally 
used in situations where the audience is not aware of the referent, the indefinite article prioritizes their 
epistemological perspective over the speaker or the discourse protagonist, paving a way for them to bea part 
of the discourse. In other words, the indefinite reverses the in medias res effect of the definite article. The 
indefinite article’s creation of this access path challenges Epstein’s contention that “[the] indefinite article 
does not trigger the construction of links to other spaces because it is a signal that the entity is not accessible” 
(Epstein 2002:374). However, in the following sections, I demonstrate how the indefinite article, when 
constructed in conjunction with contextual cues, can generate multiple access paths and mental spaces. 


' The other two types of knowledge are arguably easier to process. The second one is physical knowledge, which is the knowledge 
of the situation they are in physically. The third knowledge is linguistic knowledge, such as the commentary presented in the 
discourse. 


56 ADRIAN LOU 


I begin at the syntactic level of example (3), taken from Obama’s speech. The goal here is to show 
precisely how the lines in (3) are constructed and how the audience interprets the content. This part comes 
chronologically before (1) in Obama’s victory speech, where he discusses Cooper’s experience with the 
tumultuous events of the Civil Rights movementin the 1960s. However, the words Civil Rights movement 
are never explicitly uttered. 


(3) She was there for the buses in Montgomery, the hoses in Birmingham, a bridge in Selma, and a 
preacher from Atlanta who told a people that We Shall Overcome. Yes we can! 


The multiple NPs of (3) provide metonymic cues, in the form of proper nouns, which signify to the audience 
that the events are from the Civil Rights movement of the 1960s. This is done through the evocation of 
historical locations where many momentous civil rights demonstrations took place. Despite the indefinite 
article indirectly suggesting the NPs as not readily identifiable, these noun phrases still congeal together to 
form and refer to a broader mental space or a collective conscious that contains knowledge of the Civil 
Rights movement. If the indefinite article is pointing toward community knowledge the same way the 
definite would, then what makes the indefinite different from the definite? Why does Obama perform this 
rhetorical detour? To highlight the difference, let’s once again rephrase this section of the speech by 
replacing the indefinite articles with more accessible NPs. Example (4) substitutes the indefinite NPs of the 
original text with more culturally and socially identifiable proper nouns. 


(4) She was there for the Montgomery Bus Boycott, the Birmingham Campaign, the Selma to 
Montgomery Marches, and Martin Luther King, who told the people that We Shall Overcome. Yes 


we can! 


It is not enough to say that Obama is just poetically alluding to the Civil Rights movement to say that 
the seemingly unhealable racial wounds and divide can be remedied and overcome when we work together, 
or else example (4) would sound as rhetorically effective as (3). To the contrary, (4), like the other revision, 
exhibits a dissonance that, I argue, stems from the sudden pronoun shift from she to we. The first-person 


>> 
! 


plural weis a part of Obama’s campaign slogan, “Yes we can!” - a line that he repeats throughout the speech. 
The we is an instance where Obama overtly shifts the topicality and viewpoint from the story of Anne Nixon 
Cooper to the audience. Cooper’s story is an American story. The audience is embedded into this story of 
America’s triumphs and failures. 

The definite articles of (4) prohibit the creation of cognitive access paths, leaving only one singular 
mental space for the audience to conceive of. The rephrased sample’s sudden shift away from Cooper’s 
narrative to the audience's perspective is jarring; it does not work because the audience in (4) was not made 
a part of her story. They can only see it in medias res. The indefinite articles of (3), on the other hand, provide 
two mental spaces - one is Cooper’s biographical story and the other is a generalized narrative of a generic 
character functioning as an embodiment of all Americans. (3) begins with “She was there,” a mental space 
builder that informs the audience that the following content would be a representation of Cooper's life. The 
generalized narrative is induced by the indefinite article, which reconfigures the moments of Cooper’s 
biography into conceptual historical events. Without the indefinite article's foregrounding of the audience 
viewpoint, (4) limits and restricts the Narrative Mental Space exclusively to Cooper's perspective. This string 
of indefinite articles forges an access path that links the two spaces together, enabling the audience to have 
a double interpretation of the lines. That is, the audience canunderstand (3) as simply Anne Nixon Cooper's 
story or as a more generalized narrative of being an American. 


THE RHETORICAL FUNCTIONS OF THEINDEFINITE ARTICLE a7 


3.2. ROLEAND VALUEINTERPRETATION. To better address this contentious issue over the indefinite’s 
ability to access multiple spaces, I unpack the role and value status of Obama’s indefinite constructions. I 
assert that the use of “a bridge in Selma,” for example, activates an access path that prompts both a value 
and a role interpretation. The conversion of the events into roles destabilizes the event’s fixed property; “a 
bridge in Selma,” as a role, is not just a demonstration in 1965. Rather, it can be construed as a significant 
historical milestone that not only expanded African Americans’ right to vote, but also unveiled the brutal 
and violent tactics used to suppress African Americans from protesting. The application of the indefinite 
generalizes the historical and narrative elements into roles, again allowing two interpretations of the lines. 
We can see Cooper as physically being there as well as spiritually being there when all of these events 
unfolded. Consequently, the role status of “the moon landing” and the “fall of the Berlin Wall” are then 
instances of scientific and democratic progress. 

Another way to understand the indefinite’s facilitation of viewpoint shifting is toconsider the 
indefinite’s low level of accessibility. When a NP ranks low on the Accessibility hierarchy, the easier it is for 
the audience to figure out the possible referents. By strategically utilizing the indefinite NPs, which, 
according to Givon and Ariel, have the lowest level of Accessibility (Ariel 1988:84), the speaker prolongs 
his engagement with the audience and preserves their viewpoint as the main discursive perspective. As an 
example, let us compare these two statements: The indefinite article: “Imagine you are on a beach in Hawaii” 
and the definite counterpart: “Imagine you are on the beach in Hawaii.” The latter example with the definite 
article assumes the audience has shared knowledge about a particular beach. Directing the listeners to 
community knowledge that might not exist forcibly disrupts cognitive visualization of the location. In 
contrast, the example with the indefinite article continues to host the audience’s viewpoint. It does not 
presume that the audience has access to shared knowledge about a specific beach in Hawaii. 

In marking the locations, events and objects as indefinite ones, Obama rhetorically accommodates the 
audience’s cognitive viewpoint. The use of the indefinite description of identifiable content gives the entity 
an amorphous shape and compels the audience to understand these events in relation to themselves. It is 
important to remember that these lines are also not merely symbolic representations of history. Rather, 
these events can be interpreted as events that the audience, as a collective whole, has experienced and 
overcome. By setting up two possibilities to construe the NPs, Obama can smoothly transition to the 
audience’s viewpoint with his “Yes we can!” catchphrase. 

I intentionally avoided mentioning the presence of the definite article in (3) earlier for the sake of 
cohesion. Though our focus is on the indefinite, the use of the definite in (3) actually helps validate my 
claim that both the indefinite and definite articles can function similarly, something that traditional theories 
would reject. We can note how the definite article in (3) is followed by a noun that is pluralized: “the hoses in 
Birmingham”. The plural, in this situation, disperses the referent and its definiteness in order to generalize 
the event; and so, the definite does not demarcate a specific collection of hoses or buses. Furthermore, the 
alternating use of the definite and indefinite article eliminates the possibility that these articles are 
functioning as traditional markers of new or given information or markers of identifiability. The traditional 
perspective would fail to explain this highly irregular instance in which the speaker, Obama, shuttles 
between definite and indefinite articles in a matter of four phrases.* Therefore, the articles cannot be 
indicating levels of indefinability. They must be, instead, performing a rhetorical function. 


4. RHETORICAL FUNCTION. The indefinite article’s shifting of viewpoints can be seen as a rhetorical 
technique that supplies the speaker a way to praise or censure the audience, which would align with the 


* Another reason why this is not a typical the is because starting with the definite article would suggest the topic will be referred to 
later. Epstein calls this discourse prominence. None of the definite NPs are talked about again. 


58 ADRIAN LOU 


Aristotelian notion of epideictic rhetoric. Although it is rare for politicians to vilify the audience, the next 
example shows how the indefinite article rhetorically collectivizes the audience for the purposes of 
condemning their inactions. The following comes from President Obama’s reaction to the Sandy Hook 
Elementary School Shooting in Newtown, Connecticut in 2012. 


(5) Asa country, we have been through this too many times. Whether it’s an elementary school in 
Newtown, or a shopping mall in Oregon, or a temple in Wisconsin, or a movie theater in Aurora, or 
a street corner in Chicago -- these neighborhoods are our neighborhoods, and these children are our 
children. And we're going to have to come together and take meaningful action to prevent more 
tragedies like this, regardless of the politics. 


Like the victory speech, the indefinite articles in (5) are followed by more highly accessible information 
in the form of PPs and frame metonymies, which once again defy standard or typical indefinite article use. 
The salient topic is the Newtown shooting, and the fact that it happened in an elementary school, has already 
been made apparent in the previous part of the speech. Yet, Obama makes the Sandy Hook Shooting into a 
grammatically indefinite referent: “an elementary school in Newtown.” Though it is possible to construe 
Obama's use of indefinite articles as a way to rhetorically generalize these incidents of mass shootings as 
events that can happen in everyday America, the low Accessibility of the PP suggests otherwise. That is, all 
the indefinite referents refer to major shootings in recent memory; yet, they are presented as indefinite ones. 
To put it another way, Obama is, paradoxically, being implicitly explicit and indirectly direct. By aligning the 
Newtown Shooting with the other indefinite NPs of previous shootings, Obama produces two mental spaces 
and two ways of interpretation. One mental space conceptualizes these NPs as specific references (the 
value); the other mental space conceptualizes them as indefinite and general locations (the role). Near the 
end of (5), Obama subtly and swiftly shifts the viewpoint to the addressees’ perspective through the 
deployment of the first- person pronoun: “we're going to have to come together.” However, unlike the 
previous example, Obama condemns and censures the general American public, citing examples that the 
public’s indifference, negligence, and inaction have facilitated. This rhetorical shift, which moves discursively 
from shootings to a rhetorical petition for immediate action, is achieved through the indefinite article and 
the multiple construals of its resulting mental spaces. 


5. OTHER EXAMPLES. This type of indefinite article use is, of course, not uniquely Obama’s. Here, in a 
speech given by Robert F. Kennedy in South Africa in 1966, we see Kennedy constructing his anaphoric 
lines with an indefinite NP for similar epideicticreasons. 


(6) Yet many of the world's greatest movements, of thought and action, have flowed from the work of a 
single man. A young monk began the Protestant Reformation,_a young general extended an empire 
from Macedonia to the borders of the earth, and a young woman reclaimed the territory of France. It 
was a young Italian explorer who discovered the New World, and the thirty-two-year-old Thomas 
Jefferson who proclaimed that all men are created equal[...]These men moved the world, and so can 
we all. 


Kennedy alludes to historical figures with the indefinite NP followed by a contextualizing VP and closes off 
this section with the use of the first-person plural pronoun to celebrate the prodigious potential of the 
young. The effect mirrors what we have already seen in Obama’s speeches. Like the other examples, 
Kennedy alters the perspective after enumerating historical events and people by redirecting his message to 
the audience. At the end of (6), he says everyone has the potential to be like those people. The indefinite and 


THE RHETORICAL FUNCTIONS OF THEINDEFINITE ARTICLE 59 


its access paths enable this transition from a va/ue to a ro/e interpretation of these NPs. What is interesting, 
though, is the Thomas Jefferson reference he makes. It projects more definiteness than the others. It is 
possible that Kennedy is aware that the predominately South African audience might not be able to figure 
out who Jefferson is if the phrase had been structured in a way that parallels the established format. Jefferson 
might not be accessible community knowledge to people outside of the United States. This sudden 
appearance of the definite description reinforces Epstein’s argument that the definite article, in this case: 
“the thirty-two-year-old Thomas Jefferson,” is indicative of the narrator’s viewpoint. In other words, this is 
Kennedy interjecting his voice into the discourse. His overt inclusion of Jefferson in this pantheon of heroic 
figures, then, shows how viewpoint is being changed and manipulated to accommodate both the addresser 
and the addressee. Earlier in the speech, Kennedy utilizes rhetorical anaphora of an indefinite NP to 
produce humor: 


(7) Icome here this evening because of my deep interest and affection for aland settledby the Dutch in the 
mid-seventeenth century, then taken over by the British, and at last independent; a land in which the 
native inhabitants were at first subdued, but relations with whom remain a problem to this day; a land 
which defined itself on a hostile frontier; a land which has tamed rich natural resources through the 
energetic application of modern technology; a land which once the importer of slaves, and now must 
struggle to wipe out the last traces of that former bondage. I refer, of course, to the United States of 
America. 











Even after using a /and once, Kennedy continues with this low Accessibility marker. Accessibility theory, of 
course, would argue that once the referent is made, a higher Accessibility marker would be more fitting, 
such as a pronoun or a pronominal. In the audio recording of this speech, we can hear a brief moment of 
silence from the audience after Kennedy reveals that he has been referring to the United States and not South 
Africa. Obviously, the crowd had thought that he was speaking of South Africa. They hesitate briefly until 
some laughed and then began to applaud. Although this example is drastically different in terms of its 
rhetorical goals, many of the same functions I have discussed are utilized. As he describes a /and, the 
audience is made to believe Kennedy is speaking about South Africa. The frames and metonymies that 
Kennedy evokes are also highly selective as they have strong salience for a South African community. For 
instance, the reference to Dutch settlers is probably more obscure to an American audience than to a South 
African. All of these frames and descriptions are stored in the audience’s initial mental space. Let’s call this 
the South Africa Space because the contextual information is compatible with the knowledge structure a 
typical South African will have about his/her home country. 

Once more, the audience viewpoint is crucial in this analysis, because rewording (7) with the definite 
article, the Jand, would not only be unusual but would also not create the same effect or meaning. The 
indefinite keeps the audience involved by sustaining the viewpoint focus on them and not on a noun that is 
referred to as the /and and not by its name. Once the mental space is established, Kennedy introduces his 
true referent — the United States of America. The audience is caught off guard. The simplistic exposition 
about a /and becomes an unanticipated punch line. In order to process this abrupt change in referent 
effectively, the listeners have to construct a South Africa Space. Thus, once Kennedy informs them that 
their original mental space and all of its related frame projections are wrong, the information is 
translated to the appropriate space, the United States Space, via an access path to the United States Space, 
which is made possible by the indefinite NP. 


6. CONCLUSION. Throughout this paper, I have examined a variety of idiosyncratic uses of the indefinite 
article that traditional theories have not accounted for. I argue that the indefinite article is more than a 


60 ADRIAN LOU 


marker of new information or unidentifiable referents. Using a cognitive linguistic approach, I have 
demonstrated that the grammatical nature of the indefinite article facilitates the accommodation of the 
audience’s viewpoint by prioritizing it over the speakers. The examples illustrate how the indefinite article 
produces cognitive access paths that enable two interpretations of the discursive material, one role and one 
value. Through that, more explicit viewpoint markers like wecan be used, granting the speaker a way to 
interact with the audience rhetorically. There are many examples that I left out simply because I could not 
cover all of them. Further research should continue to examine how the indefinite article works in 
conjunction with Accessibility theory and mental space theory, which - practically speaking — offer linguists 
an effective way to tease out the subtle minutiae that many other theories have neglected. Moreover, future 
research should take an in-depth look at other genres that might use the indefinite article in this way and 
investigate other possible rhetorical effects of the indefinite article as well. 


ACKNOWLEDGEMENTS. I want to thank Jessica de Villiers for encouraging me to present at the LACUS 
Forum XLI and Barbara Dancygier for providing me tremendous help and guidance since the inception of 
this project. 


REFERENCES 


ABBOTT, BARBARA. 2006. Definite and indefinite. Encyclopedia of language and linguistics (2nd ed.). 
Boston: Elsevier. 

ATWOOD, MARGARET. 1986. The handmaid’s tale. Boston: Houghton Mifflin. 

ARIEL, MIRA. 1988. Referring and accessibility. Journal of linguistics 24.1:65-87. 

BRINTON, LAUREL J. & DONNA BRINTON. 2010. The /inguistic structure of modern English. Amsterdam: 
John Benjamins. 

CLARK, HERBERT H., & CATHERINE R. MARSHALL. 2002. Definite reference and mutual knowledge. 
Psycholinguistics critical concepts in psychology: 450. 

EPSTEIN, RICHARD. 2001. The definite article, accessibility, and the construction of discourse referents. 
Cognitive linguistics 12.4:333-78. 

GIVON, TALMY. 1993. English grammar: A function-based introduction. Amsterdam: John Benjamins. 

KENNEDY, ROBERT F. Day of affirmation. Speech, Cape Town, South Africa, June 6, 1966. RFK in the 
land of apartheid. Available at http://www.rfksafilm.org/html/speeches/unicape.php. 

OBAMA, BARACK. Election night victory speech. Speech, Grant Park, IL, November 4, 2008. 
available at http://obamaspeeches.com/E11-Barack-Obama-Election-Night-Victory- Speech-Grant- 
Park-Illinois- November-4-2008.htm 

---. Statement by the president on the school shooting in Newtown, CT. Speech, Washington DC, 
December 14, 2012, White House. Available at http://www.whitehouse.gov/the-press- 
office/2012/12/14/statement-president-school- shooting-newtown-ct 

SHORT, MICK. 1996. Exp/oring the language of poems, plays and prose. London: Longman. 


This article was first published at lacus.weebly.com. 


LACUS FORUM 


Journal of the Linguistic 
Association of Canada and the 


United States 

VOLUME 41 NUMBER 2, 2020 
CONTENTS 

FAUXPOLOGY AS A PARASITIC SPEECH ACT: 1 


EVIDENCE FROM PUBLIC DISCOURSE 
Keith G. Bateson, Pushp S. Narula, & Magda Stroitiska 


CHALLENGING THE PRESCRIPTIVE-DESCRIPTIVE DIVIDE: 15 
AN ANALYSIS OF WORDREFERENCE FORUMS 

Michael D. Kiiffer 
BUILDING MULTILINGUAL COMPETENCE AND THE 23 


“AWAKENING TO LANGUAGES” APPROACH: ONLINE AUTOMATIC 
TRANSLATION TOOLS IN MULTILINGUAL TEACHING MATERIAL DESIGN 
Ettychia Damaskou 


CLASSIFYING UNCLASSIFIED SPEECH ERRORS: oT 
AN EXAMINATION OF THE UNEXAMINED 
William J. Sullivan & Sarah Tsiang 


LACUS Forum 41:2.1-14 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 


FAUXPOLOGY AS A PARASITIC SPEECH ACT: 
EVIDENCE FROM PUBLIC DISCOURSE 
KEITH G. BATESON PUSHP S. NARULA MAGDA STROINSKA 
McMaster University 





Abstract: This paper argues that the apology is a performative speech act despite the lack of the rigorous 
conventional procedure demanded by John Austin. In recent years, there have been so many insincere and 
evasive apologies in the media that the blend fauxpo/ogy has been coined to categorize this type of non- 
apology apology. The paper argues that the media cuts and splices video clips of infelicitous apologies to create 
a felicitous apology and removes important sections of felicitous apologies to create misfires. 


Keywords: apology, speech act, Austin, fauxpology, performatives, media manipulation 


Languages: English, Classical Greek, French 


ACCORDING TO THE POPULAR DEFINITION, a fauxpology happens “[w]hen a person makes it sound 
like they are apologizing when, in fact, they are just shifting the blame or using twisted logic to argue their way 
out of responsibility for their actions.” This strategy is “[o]ften used by pro athletes, movie stars, and 
politicians when they get caught cheating, making absurd comments, or get arrested for drugs” (Urban 
Dictionary).' A fauxpology is thus a speech act that exploits the form of an apology, but then does not fulfill 
its mandate, serving, instead, the purpose of protecting the speaker from liability or other forms of possible 
litigation (Lazare 2005:173). One can note that British Columbia and some of the other provinces in Canada 
now have actual Apology Laws which prevent apologies being used as evidence in legal proceedings. 
Another name used to define fauxpology is “non-apology apology,” suggesting that, “with sufficiently artful 
double talk, you can get what you want by seeming to express regret while actually accepting no blame” 
(McCall 2001:1). The fauxpology can be considered a parasitic speech act (cf. Halion 1992) in that it depends 
on the prior existence of the apology model. The immediate question then is, what is anapology? 

Defining apology is, ironically, more problematic than defining the fauxpology. Many articles and 
books, for example Mea Culpa: A Sociology of Apology and Reconciliation (Tavuchis 1991), quote, in its 
entirety, the OED’s definition of the word apology, deriving it from the ancient Greek word azo ‘away, off 
and Aoyia ‘speaking’, leaving us with dzroAoyia ‘defence’ or ‘a speech in defence.’ This use of ‘apology and 
‘defence’ is evident in both Plato’s Azoloyiaa Swxpatia‘ The defence of Socrates at his trial and Xenophon’s 
lesser known Azoloyiaa Swxpatia: Ilpoo Tovo Axaoao ‘Socrates’ defence to the jury. However, neither 
apology nor defence are in reality adequate representations of what is happening within Plato’s text. Plato’s 
report of Socrates’ speech would need to be contextualized within the framework of other contemporary 


' http://www. urbandictionary.com/define.php?term=fauxpology 


2 KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


literature, for it quickly becomes apparent that Plato’s Socrates is full of insults against a number of 
individuals (Plato in Apo/ogy 84-5), is unrepentant on the topic of his behavior (111, 113, 129, 137), appears 
firm in his promise that he will never change his behavior (111), and asserts that he does not need “much 
of a defense” (103). This sequence assures us that Socrates’ apology is, indeed, a fauxpology. A further 
examination of these texts reveals how the notions of apology and defence quickly become obfuscated by 
personal motives unaligned with social repair or justice. These texts reveal competing views of one’s 
responsibility to the state and foreground how one’s speech acts and membership in a state have inescapable 
consequences. They assert that a social and political contract exists and has been accepted by Socrates (Plato 
in Crito181). 


The term apology starts occurring in English literature in the 16" century with the Apologia of Syr 
Thomas More, Knight (7533), An Apologia or Defense of our Days (1589) by F. Triage (OED). Philip Sidney’s 
Apology for Poesy( The Defense of Poesies)’ is part of the Renaissance and Early English use of the apology 
as a form of attack arguing for the importance of the poet and his work, much as Socrates does. Both the 
ancient and Renaissance use of the terms and concepts of apology and defence are not totally alien to the 
modern usage, because an apology still involves an often extended explanation, such as Socrates’, if it is to 
be effective. In 1594 we find Shakespeare's Richard [//giving evidence of the change in the use of apology to 
mean “[a]n explanation offered to a person affected by one’s action that no offence was intended, coupled 
with the expression of regret for any that may have been given,” and we also have the possible meaning of “a 
frank acknowledgement of the offence with expression of regret for it, by way of reparation” (OED). Socrates 
asks a fundamental question: “ought a man to do what he has agreed to do, provided it is right, or may he 
violate his agreements?” (Plato in Crito175).° 


1. THEORETICAL APPROACHES TO APOLOGY. Recent narratives dedicated to the topic of apology 
frequently rework or extensively quote the earlier Greek texts. We now also have an entire apology genre of 
short works targeting a general and business readership, such as The one minute apology (Blanchard & 
McBride 2003), Theartofthe apology(Bloom 2008), and The fivelanguages ofapology (Chapman & Thomas 
2008), which are fairly accurate in terms of the basic elements of an apology. On the other hand, Aaron 
Lazare’s (2005) On apology attempts a more cohesive and semi-comprehensive treatment of apologies, 
using a mostly anecdotal framework. William Benoit (1994) contributes a more rigorous treatment with 
Accounts, excuses, and apologies: A theory ofimage restoration strategies, focusing on the strategies people 
employ to minimize responsibility for their own actions. This work is notable for the tables giving a useful 
chronological listing of contributions to the scholarship in the category of theories of accounts, justifications, 
refusals, concessions, et cetera (51-61). Also, of particular interest in Benoit’s work is the inclusion of a 
critical listing of how scholars subcategorize these elements (51-61). An informative overview of apology is 
Robin Tolmach Lakoffs (2001) paper, “Nine ways of looking at apologies: The necessity for 
interdisciplinary theory and method of discourse analysis.” Tavuchis (1991) has a narrower focus than 
Lakoff, while Nick Smith’s (2008) J was wrong: The meanings of apologies, analyses apologies in order to 
differentiate the diverse conditions and fundamental issues more rigorously than most scholarship to date. 

The topic of apology, its rationale, its social and political context, and its philosophy is contextualized 
in terms of a theory of punishment in Christopher Bennett’s (2008) The apology ritual. This text may not 
lend much in the way of insight into the apology as a performative speech act, but it does provide a rigorous 


* Albert Feuillarat uses this title in his landmark edition The Complete Works of Sir Philip Sidney, vol. II. Cambridge UP, 1923 


° “In questions of right and wrong and disgraceful and noble and good and bad, which we are now considering, ought we to follow 
and fear the opinion of the many or that of the one, if there is anyone who knows about them...” (165). 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 3 


treatment of the act of apologizing in terms of retribution, responsibility, and restorative justice. Obviously, 
there are different stakes and consequences that derive from different apologies. The major issues include 
questions of responsibility and politics, especially when one considers historical offenses, compensation for 
victims, survivors, and descendants, and other potential liabilities. However, there are also fundamental 
problems with revealing how historical offences happened and the obvious resistance to acknowledging and 
taking responsibility for past crimes, especially those perpetrated by the state. This topic is rigorously 
attempted, particularly asa process, by Jennifer Lind (2008) in Sorry states: Apologies in international politics. 

Most specialized data on apologies can be found in journal articles, often with the focus on how the 
apology, or its lack, create issues that leave a powerful imprint on human relations. The theme of heinous 
crimes associated with past wars or social conflicts constantly recurs as previously hidden and neglected 
details of past events are researched, uncovered, identified (denied), and eventually published. It may seem 
that an apology can do little to heal the wounds resulting from inhumane crimes, and yet many symbolic- 
sometimes silent-actions are performed, but usually not by the original offender and often not to the original 
victim. A great proportion of articles deal with the practical applications of apologies and how they are being 
dealt with in terms of legislation. For years, “[t]he law held statements admitting fault were admissible in 
civil action” (Newfield 2007:137). Therefore, clients (of “[d]efense lawyers, insurers, and risk managers, 
[who] wary of the exposure, frequently steered clients away from such admissions’), at least in health care, 
were advised not to “admit fault” because patients were allowed to testify that the “health care provider 
‘admitted’ having provided improper care following an incident” (137). Recently, there have been 
significant changes in how statements of sympathy and even apologies are treated by various legal systems. 
For instance, the so-called sympathy laws “provide an exception to the general rule regarding admissions, 
rendering statements of sympathy or apologies inadmissible in any civil action” (137). By 1986, the state of 
Massachusetts introduced “a law rendering statements of sympathy inadmissible in a civil action” (139).* 
In Texas, California, Florida, and Massachusetts, “apology legislation provides that an expression of 
sympathy or regret is not admissible to establish liability; however, that part of an apology that contains an 
admission of fault or liability is either not covered by this law (137), specifically protected or is specifically 
excluded,” while other US states have stronger regulations protecting sympathetic expressions, regrets, and 
apologies (Macleod 2008: xv). In total, thirty-six American states do have apology laws.’ In Canada, “[t]he 
Uniform Law Conference of Canada and the Canadian Patient Safety Institute (CPSI) have both encouraged 
all provinces and territories to enact apology legislation.” Currently, all provinces except Quebec, New 
Brunswick, Yukon and the Northwest Territories have apology legislation in place.° 


2. ELEMENTS OF AN APOLOGY. Erving Goffman (1971), drawing on the Radcliffe-Brown model 
(Radcliffe-Brown, 1952), insists that there are social norms that guide people’s actions and that these norms 
are reinforced by both negative sanctions providing “penalties for infraction, and positive [reinforcements] 
... providing rewards for exemplary compliance” (Goffman 1971:95). These sanctions may “be organized or 
diffuse.” Goffman upgrades these two categories into formal sanctions, as when a “specialized agency that 
has been officially delegated the sanctioning task acts in due response to a schedule of sanctions” and the 
“informal [sanctions] when the work is done locally, largely by the very person whose concerns have been 
jeopardized or by those who personally sympathize with him” (95-96). In the Radcliffe-Brown model 
“[nJorms or rules impinge on the individual in two different ways.” Norms may manifest “as an obligation 


* Mass. Gen Laws, chap. 233, sec. 23D, per Newfield 137. Note that “admissions of fault” are not covered by this law (137). 
° See http://www.sorryworks.net/ for a full list of the states. This site also provides a list of news articles dealing with 
disclosure and the use of ‘sorry’. 


° https://www.cmpa-acpm.ca/-/apology-legislation-in-canada-what-it-means-for-physicians 


4 KEITH BATESON, PUSHP NARULA, &XMAGDA STROINSKA 


that requires him to do (or refrain from doing) something in regard to others.” Conversely, the 
manifestations can also occur “as an expectation that leads him to anticipate righteously that something will 
be done (or specifically not done) by them in regard to him” (Goffman 1971:96-7). These penalties and 
rewards are significant “in what they proclaim about the moral status of the actor” (95). The media play a 
particularly significant role in creating the image of the actor. We will note, later in this paper, how the 
media often cut and splice certain categories of videos in order to actually inflate transgression and establish 
a pattern of acceptable or unacceptable behaviour - a certain flavour of “social personality.” 

Although people are often encouraged to shrug off such social aggressiveness on the part of the media, 
William Benoit (1995) asserts that “[t]hese attacks on our reputation are serious matters, for our image or 
reputation is extremely vital to us.” Certainly, our face helps with “a healthy self-image, but it also can create 
important favorable impressions on others,” and “a bad reputation may interfere with our interactions with 
others” (2). Benoit insists that “[w]hen such inevitable (apparent) misbehavior occurs, others are very likely 
to accuse, attack, berate, blame, censure, condemn, rail against, rebuke, or reproach us or object to our 
behavior” (1). The threatened image may compel us “to offer explanations, defenses, justifications, 
rationalizations, apologies, or excuses for our behavior” (2). 

Goffman’s coverage of remedial work aids in the analysis of apologies because the “function of remedial 
work is to change the meaning that otherwise might be given to an act, transforming what could be seen as 
offensive into what can be seen as acceptable” (1971:109). In Goffman’s model, restoring social relations is 
one aspect of restoring face, and, therefore, image restoration may also become a critical, alternative motive 
for making insincere apologies rather than a full apology. An insincere apology may restore social relations 
at less cost to face, without accepting responsibility, and, therefore, it helps to avoid the necessity of making 
amends. Remedial action is accomplished using “three main devices: accounts, apologies, and requests” 
(109). 

In relation to conventional procedures, Lazare (2005) insists that an apology is a process which begins 
with “acknowledging the offense” (76); it then moves on to “communicating remorse and the related 
attitudes of forbearance, sincerity, and honesty”; “explanations”; and finally, “reparations” (107). Lakoff 
(2001:202) observes that there are “many forms available for the performance of the single act of apology”. 
This does seem to contradict or at least undermine Austin’s insistence that for an utterance to be a 
performative there must be a conventional procedure that is executed through the speech act. Benoit (1995), 
referencing Goffman, barely deviates, except in language, and also asserts that the elements of an act of 
apology are an “expression of regret, acknowledgment of expected behavior and sympathy for the reproach, 
repudiation of the behavior and the ‘self? committing it, promise to behave correctly in the future, and 
atonement and compensation” (Benoit 35). David P. Boyd (2011), in Art and artifice in public apologies lists 
“seven sequential components of an apology: revelation, recognition, responsiveness, responsibility, 
remorse, restitution, and reform” (299). Marion Owen (1983) also uses Goffman, but stresses the turn 
taking aspect of apologies (19-48). Jennifer Lind’s 2008 book Sorry states: Apologies in international politics 
foregrounds a different logic and asserts that conventional apologies-if they exist- are not necessary for 
reconciliation between countries that performed atrocities against each other (3-7). Such differences of 
opinion and models among scholars support the argument that apologies may be infelicitous by nature, 
because there is no single, conventional way to perform. Therefore, either apologies are not performatives, 
or they are a variety of apologetic forms that can be considered conventional. 


3. AUSTIN’S CONDITIONS FOR A SUCCESSFUL APOLOGY. Austin’s rules are the most quoted, often 
in toto, and frequently referred to when dealing with performatives, because they are the closest thing we 


C 


have to a system capable of evaluating performative utterances. The apology is one of the “‘explicit’ 


performatives, and of that preponent class later called “exercitives’”” (1962: 5, fn 1). Elsewhere Austin refers 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 5 


999 


to it as “of that class-the ‘masqueraders’” (4). The “performative sentence” is peculiar in that “the issuing 
of the utterance is the performing of an action” (6) but the utterance must meet certain rules in order to be 
considered “happy” or felicitous (15). Indeed, there are “some ... things which are necessary for the smooth 
or ‘happy functioning of a performative (or at least of a highly developed explicit performative” (14). The 
essential condition associated with a felicitous performative, in this case the apology, is the achievement of a 
specific “effect” (forgiveness), that is dependent on the existence of a “conventional procedure” which 
includes a specific speaker (the offender), in a specific context, uttering specific words (26). 

The felicity of the apology performative depends on avoiding each of Austin’s six categories of errors 
(we can think of them as negative conditions of felicity). The four errors on the left side of Austin’s tree (see 
Figure 1) are types of misfires (AB). The two errors on the right are forms of abuses (IT). Fauxpologies are 
infelicitous due to abuse by insincerity (T.i). However, fauxpologies can also, simultaneously, include 
elements of musfires. Evidence for the existence of Austin’s initial condition (A.1), the conventional 
procedure, is satisfied first, by the fact that both he (45-6) and Goffman (1971:119-20) classify the apology as 
a performative, second by the close overlap in the literature identifying elements of apology (Benoit 
1995:35), and finally by the number of successful examples of apologies provided by the literature which 
conform to these same elements. This seems to satisfy the A.1 misfire, which Austin originally termed non- 
plays but later labelled (18, fn 1). If the speaker or context is wrong, then the apology misfires AB due to 
misapplication A.2 (28). Yet Austin also asserts that the procedure may be rejected, and some infelicities 
“should be brought into our present class A.1 or rather into A.2 (or even B.1 or B.2)” (27-8), thus 
demonstrating some flexibility in assigning these errors into categories, and allowing for occasional 
instances of overlap. It is insufficient for the apology just to be accurately performed by the offender; 
it is also necessary that all the participants execute their parts fully and correctly, otherwise the performative 
misfires due to one or more flaws(B.1), or a Aitch (B.2), as when the complete sequence is aborted at some 
point (14-5). The non- acceptance of the apology by the offended is a flaw, as is the case of lack of 
“consensus ad idem between the parties (36). Any of these misfiresmakes the performative void (18, 39). In 
order for the apology to be happy/felicitious, the speaker must also avoid infelicitiesdue to abuses, and, 
therefore, the apologizer must possess the appropriate feelings, be thinking the correct thoughts (honesty), 
and possess the required intentions. Thelackofany one of thesefeatures creates an insincerityabuse(T.i)(40). 
The final condition of the apology performative is that the speaker intends to do what the utterance 
contracts them to do— usually this means refraining from committing the offence again (43-44). 


Infelicities 
a r 
Misfires Abuses 
Act purported but void Act performed but hollow 
A B T.i lili 
Mis-invocations Mis-executions Insincerities ¢ 
Act disallowed Act vitiated 





A.1 A.2 


| 
? = Mis-applications Flaws Hitches 


Figure 1. John Austin’s infelicity tree for performative utterances (1962:18) 
The positive conditions associated with the felicitous apology, therefore, include the true offender being 


6 KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


in the presence of the offended (context and correct people), addressing the offended directly with the 
appropriate mixture of contrition for the offending act, sincerity in stating remorse, and a promise not to 
perform the offence again. Further, the offender needs to make a statement of the details of the offence, 
with some type of offer of reparations, and either in private or public, depending on the victim’s preference, 
beginning with the offender stating, with a suitable tone of voice, “I apologize for ...” The apologizer’s full 
and appropriate statement must include asking the victim for forgiveness and having the victim 
acknowledge (accept) the apology, for “the procedure must be executed by all participants both correctly 
and completely” (14-15). 

We shall now turn to some case studies, presenting recent examples of public apologies and their 
analyses in term of performative speech acts. 


4, APOLOGY AS A PERFORMATIVE SPEECH ACT: TWO CASE STUDIES. We use John Austin’s 
performative tree to analyze both the apology as a performative and the fauxpology, a non-apology apology, 
as a non-performative. Austin’s assertion that the apology is a performative speech act (1962:46) may be 
problematic because for a speech act to be a performative utterance, a conventional procedure must exist 
for its execution (26). This requirement is not as straightforward as one would initially think/believe. 
Similarly, J. O. Urmson asserts that Austin made a minor error when he included the “I do” speech act of 
the marriage ceremony as an example of a performative (5, fn. 2). Austin also maintains that the list of 
infelicities is incomplete (24), that “different infelicities can be combined or can overlap”, and that there 
may be a certain optionality in classifying “some given particular example” (26). Even the labels for 
categories of infelicity are mutable (18, fn. 1). 


4.1. DATA SET I: TORONTO MAYOR ROB FORD. Rob Ford, the mayor of Toronto (2010-2014), became 
notorious in Canadian media, not just for denying many allegations of drug and alcohol abuse and ranting, 
but also for the significant number of his infelicitous apologies that culminated in the infamous December 
16, 2013 clip where he seemed to be naively asking Frances Nunziata, the Speaker of the Toronto City 
Council, what an apology was in a protracted, seemingly inept sequence. Ford starts to blurt “I apologize,” 
then “I’m sorry,” and finally launches into a “I am super, super, super ... sorry” apology, which, of course, 
went viral. In light of our experience with media editing of visual records (see Data Set II for more details), we 
sought inconsistencies amongst multiple variants of what seemed to be Ford’s latest fauxpology. In the case 
of Rob Ford, we sought the missing clips, reassembled them into the correct sequence and transcribed the 
results. We discovered that the “super, super ... sorry” pantomime, which previously appeared isolated and 
added up to a humiliating representation of Ford, omitted or neglected a key fact-the Speaker had already 
accepted Ford’s earlier apology, as evident with the earlier (often omitted in the media) segment: 


Speaker: Ahmm, Mayor Ford, last night you made some disparaging 
comments regarding members of council as well, ah, being 
corrupt. So Mayor Ford 1am am asking you to apologize to 
members of council please 
... Ah, just a sec ... Okay? 


Ford: Yeah, I would have rather run first but I go second. Ah, ahh. I 
withdraw my comment saying that. I think I said, ah, I think I 
said, ah, that only the corrupt councillors can stay. I withdraw 
that comment. 


Speaker: Okay, thank you. 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 7 


| palpable muttering in council| 
Unknown voice: | missing] ... apologize ... 
Speaker: See. He apologized. He apologized ... I’m sorry, apologized for that comment. 
[mutter|’ 


On the one hand, the mayor and the Speaker are following appropriate parliamentary 

procedure, and the Speaker is the appropriate person to accept mayor Ford’s apology on behalf of the 
council; she does so, and thus Ford’s apology is rendered felicitous. On the other hand, after Ford’s words 
are accepted as apology by the Speaker, a councillor objects out of turn, ignoring procedures, triggering the 
“super, super sorry’ response from Ford. 

It is here that Councillor Parker interjects with a point of order, stating that he wants Ford “not [to] just 
withdraw the remark but to withdraw the implication and innuendo that was [...] contained in the remark, 
but it wouldn’t hurt to add an apology as well.”® There is no formal call for an apology here. It is the Speaker 
who changes the felicity of the previous apology by suddenly reversing her acceptance with: “Okay, mayor 
Ford would you like to try it again,” and “What the word is, is that you need to say apologize.” It may be 
worth noting that Councillor Parker avoids the damage to face that would accrue if the challenge failed (see 
Goffman’s theory (1967:27-28) in Owens (1983:15)) by phrasing it in terms of an indirect afterthought with 
“it wouldn't hurt”. 

This representation of Ford, although it may be accurate in many ways, is skewed by a mediated 
contextualization that the media constructs by deleting the initial sections, splicing, and deleting clips. The 
salience of the clip with Ford’s angry rant dictates that his apologyis considered aI .i fauxpology(i.e. abuse, 
insincerity). However, the fuller transcript reveals that Ford had in fact withdrawn the previous evening’s 
comment to the satisfaction of the appointed Speaker, and that his was accepted as an apology within the 
conventions of the meeting. Parker’s speech and its misinterpretation by the Speaker as a call for another 
apology, combined with the audience's chanting, create a situation where all the actors misunderstand what 
is required of them. Parker wants a withdrawal of the comments. The Speaker rules that for Ford to apologize 
he must withdraw his comments. Ford rightfully says he has withdrawn his comments, which he has. There 
is confusion and competing views as to what constitutes an apology for all the people at this meeting. Initially 
the Speaker declares that Ford has apologized, then is persuaded that he has not. There does not seem to bea 
generally agreed upon conventional procedure for apology in this venue, and therefore, in Austin’s terms, an 
apology cannot be seen as a proper performative speech act, at least in this case. This raises theoretical 
questions about the felicity conditions for an apology in the public space. 


4.2. DATA SET II: THE NORTH YORK GENERAL HOSPITAL. On November 19, 2013, Justice McCombs 
ruled the anaesthesiologist Dr. George Doodnaught guilty on twenty counts of sexual assault that occurred 
at the North York General Hospital (NYGH) in Toronto.’ Following the ruling, the C.E.O. of the Hospital, 
Tim Rutledge, held a press conference.'’ The video segment of the press conference posted on the Hospital’s 
website evoked mixed reactions among the three authors who were divided in their impressions. Was the 


” http://globalnews.ca/news/1035273/rob-http://globalnews.ca/news/1035273/rob-ford-super-sorry- for-comment-made-at- 
Monday s-council-meeting/ 

® Dec. 17, 2013 6:55 p.m. (http://globalnews.ca/news/1035273/rob-ford-super-sorry-for-comment-made- at-Mondays-council- 
meeting/) 

* http://globalnews.ca/news/1170480/toronto-anesthesiologist-doodnaught-to-be-sentenced- for-sexual-assault/ 


'® http://www.nygh.on.ca/Default.aspx?cid=2572&lang=1 


8 KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


C.E.O.’s speech a felicitous or infelicitous apology, or even a fauxpology? When the video was played back, a 
slight visual flicker was noted, raising the suspicion that the video material had been edited, mostly likely to 
reduce segment time. Further investigation gave different results. Longer video sections were examined 
using Camstasia© software (video/audio) to detect visual editing, then both Camstasia© and SONAR© to 
examine and compare the audiotracks, and finally CorelVideoStudioX/" to compare results and attempt to 
generate similar re-splicing capabilities. Obviously, the professional television stations have more 
sophisticated software, but even such equipment and experts have their limitations. The audio tracks from 
the press conference appeared seamless, but they had to be synchronized with the video footage in order 
to avoid ending up with bad lip syncing. While the human ear can detect an audio glitch, they are untrained 
at detecting very minor cuts and splices in the visual stream due to limited attention span, eye movement, 
and other factors. This means that the visual stream will leave traces of editing under close analysis. The 
software tests and script comparison allowed us to reconstruct the original speech, which included some 
sections of the question and answer period. The reconstruction also helped identify additional, near 
imperceptible, jumps, which we interpreted as indications of editing. 

What was at stake here was the status of the NYGH apology in terms of its felicity. When we spliced the 
various sections of the three versions posted on various reputable news and hospital websites together, it 
became obvious that the news media had taken Rutledge’s speech (that sounded a lot like a fauxpology) and 
inserted and rearranged different clip sections, thereby, perhaps inadvertently, creating what appears to be 
a felicitous apology. The fact that Rutledge was wearing the same necktie throughout the published video 
suggests that the clip is from the same day, just a different part of the press conference. The original version 
is infelicitous because the speaker focused on how “proud of the high quality of care, and safe care that we 
provide at North York General” he was, and created further distance from the assaults by focusing on details 
of the trial and how the hospital was changing policies. The result, working down Austin’s infelicity tree, 
suggests that the speech was a misfire, mis-execution, and that it was Hawed. We also identified possible 
elements of abuse (act professed but hollow) due to the perceived insincerity of the statement, focusing on 
the positive rather than fully acknowledging the harm done. This method of analysis results in considering 
the apparent apology as, in fact, a fauxpology. 

The C.E.O.’s apology is made acceptable through the insertion of “and on behalf of the hospital I 
sincerely apologize...” and the deletion of a large section whose emphasis was on the strength of the hospital 
and the hospital’s shock that “trust was broken by a doctor who worked here.” This creates a distancing effect 
with its implied refusal to take responsibility. This distancing effect is further reinforced by extensive 
sections dealing with changes to hospital policies, which seem to be only deferred modifications couched 
in terms of planning. The inserted section (“I sincerely apologize”) modifies what is perceived as an 
insensitive, flawed, and insincere speech act — a non-apology apology. Thus, in this case, the fauxpology 
offered by the speaker is deconstructed and reassembled by the media to create (not necessarily on purpose) 
a somewhat felicitous apology. 

When the C.E.O.’s apology is placed in the context of the NYGH’s ongoing media relations, one first 
discovers that it is difficult to connect this attempt at apology with a pre- existent series of promotional 
videos. A Feb. 18, 2014 clip “Tim Rutledge: Making a World of Difference,” is the last part of an eight-part 
series started in April 2013. In this video,'’ Rutledge wears a bold gold paisley tie versus the business-like 
diagonal stripe one of the November 19, 2013 clip.'* He seems heavier. The hair is lighter, and his lower 
face sports a sparse, light colored mustache and goatee. In the promotional clip of February, gone is the 


'! www.nygh.on.ca/Default.aspx?cid=3057 &lang=1 


2 https://www.youtube.com/embed/-lagfO 1IMf8. Redirected from http://www.nygh.on.ca/Default.aspx?cid=2374&lang=1 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 9 


expensive business suit, replaced with a cheaper substitute. He has dispensed with his glasses, and is doing 
a casual walking tour of the hospital shot from the front. It is great public relations, but the timing, one 
week before the sentencing of Dr. Doodnaught (February 25, 2014),'’ seems too coincidental—almost 
unlikely. The release date weakens the salience of the rearranged press conference and pushes the pseudo 
apology further into the [.i (insincerity) category. Still, as Zohar Kampf (2013:150) points out, “[I]t is quite 
exceptional to find a felicitous apology in the public context.” 

In the case of Ford’s accepted apology, which is then transformed into an infelicitous performative, one 
must take into account how “[t]he mediated conditions in which an apology is realized are yet another 
stumbling block in the way of admitting wrongdoing” (Kampf 2013: 150). As Kampf further notes, a 
number of relevant processes are at work in public mediated apologies, especially how “journalistic 
construction of melodramatic narratives around apologies in many ways recalls the structure of social 
dramas’ (156). He believes that the reconstruction of apologies and their contexts by journalists is partially 
motivated by their desire to “contribute to [the] intensification of conflicts” (155) in order to create an 
audience, but the process also establishes the journalists as agents performing the work of actively 
investigating, articulating, revealing and communicating the transgressions of social norms by political or 
public actors. The biased selection of video clips unfavorable to viewing Ford’s apology as serious supports 
the view that “[j]ournalists, as representatives of the public in seeking cues to sincerity, may frame apologetic 
gestures as hollow by criticizing their infelicity” (158). A more cynical interpretation would be that 
“apologies may [not only] serve as a means of legitimizing journalistic work” but become a way of enforcing 
societal norms and expectations. This, Kampf says, lends the journalists an air of “temporary superiority 
over political actors throughout the conflictual process” (155). 


5. PRIVATE VERSUS PUBLIC APOLOGIES: MEDIA MANIPULATION. If Ford’s apology had been 
performed in private, one could easily see that, even if not adept, at least he was capable of a felicitous 
apology. In one sense Ford’s apology is a private apology, because it is issued to one person-the Speaker- 
but her deputy (Councillor Parker) interjects with a specific request for a more explicit withdrawal while 
the audience calls for an apology. This demonstrates that “[w]hen remedial acts like apologies are extracted 
from the private sphere and issued in the mediated public domain, the logic of their realization is changed” 
(Kampf 2013: 150). Further, the insertion of the mutters and demands from other attending councillors, 
staff, and the public typifies how “the media plays a crucial role by transferring words to heterogeneous 
audiences, each having different normative assumptions” (150). In this case, the normative assumptions are 
the varying conceptions of what constitutes the conventional method of uttering an apology. If a group of 
politicians and their staff in a metropolitan city are unable to agree on what the convention is, other than to 
reiterate the word “apology” in a loud voice, then one should question not only what constitutes a 
conventional method but also whether the apology is a performative. If stating a sincere apology when it is 
appropriate, in this case after issuing a joking remark, to the appropriate person and having it accepted by 
the appropriate person, the Speaker, does not constitute a satisfactory performative speech act, then 
Austin’s conceptualization or at least the demand for a conventional procedure needs to be reexamined. 

It seems likely that the apology, especially when performed in public, can still be a performative, but it 
appears that either no one knows the correct procedure or there is another factor informing the felicity of 
the performative. Separating the conventional procedure into different intensities and contexts of offences 
may resolve this issue. For instance, Kampf in his “discourse analysis of 1,020 apology-related utterances 
reported by the Israeli press between 1997 and 2004” distinguishes two different contexts: the “slight and 


'S www.cbc.ca/news/canada/toronto/dr-george-doodnaught-gets-10-years-for-sex-assaults-  1.2550308. 


10 KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


everyday offences” and the “high-profile apologies for historical crimes” (2013:149). He also observed 
contexts suchas “severe short-term transgressions, such as apologies following the killing of civilians” or a set 
of “mid-range transgressions of societal norms such as those provoking political or diplomatic scandals” 
(149). 

Of particular interest in relation to the Ford “super apology” is Kampf’s fifth type, “transgressions 
within a specific professional community, such as disloyalty between peer politicians” (149). The latter is 
relevant here because Ford’s initial quip that “the more corrupt ones [councillors] can stay”"* destroys the 
notion of equality inherent in the term “peer” by dividing the councillors into two groups: those of proper 
moral standing and those who are corrupt. The jocular comment is the original offence for which the Speaker 
calls for an apology. Ford defends himself with “T didn’t name names.” One must note here that at the time 
of the original joke the council meeting had been out of control for quite some time (and this is the unreported 
context of Ford’s outburst). 

Ford’s fauxpology, when classified in terms of Austin’s infelicitous performatives, appeared awed (B.1) 
and insincere (1.i) but only in the context of a media manipulated apology. The retraction of previous 
comments was, in itself, a performative speech act that was infelicitous in that it misfired. However, this is 
only apparent when a full transcript is merged, covering the end of one meeting and the beginning of the 
following. What is certainly missing in media coverage of Ford’s apparent fauxpology is the full 
contextualization of Ford’s quip. That utterance might be less of an offence than the media clipping suggests. 
Also, the more complete context might reveal what motives and agendas were in play. 


6. APOLOGY AS A PERFORMATIVE SPEECH ACT. Both the NYGH and Ford reworked video clips 
foreground essential elements and requirements of the apology: the apology must follow a number of 
seemingly precise conventions if it is to be, in John Austin’s formulation, a felicitous performative, i.e. a 
speech act that performs an action. It must also be done completely. While we may suggest that Austin is 
unrealistically rigorous in his initial model of the apology as a performative speech act, the paradigm is an 
integral part of what must constitute any performative, whether an apology or the christening of a ship for 
its launching. This insistence on a conventional procedure in order for a performative to exist- for an 
apology to be considered a performative — must always come back to the question: “What is the conventional 
procedure for the apology?” A not so obvious difficulty is that up until recently, there were very few 
researchers who could articulate what they believed to be the necessary and complete elements of a felicitous 
apology. Does this mean that it was, until recently difficult or impossible to define or evaluate when an 
apology is a performative and, if so, whether it is felicitous? The work of Kampf and others suggests that there 
are degrees of intensity of offences, and, therefore, the s/militude of an apology must vary accordingly. In 
cases of heinous crimes, it seems unlikely that any apology would do other than misfire. 

In some cases, misfired apologies have been identified and “proper” procedures articulated by the 
offended parties. For instance, in May 2014 Manitoba First Nations Chief Arlen Dumas refused the Laurie 
River Lodge apology for printing in their brochure that “like all Native North Americans, they [First Nations 
guides from the town of Pukatawagan] have a basic intolerance for alcohol.” He clearly stated what, for him, 
a felicitous apology should consist of: the Lodge must “offer a public apology to all Cree and Indigenous 
peoples; write a personal apology to each Indigenous employee; and find a way to make amends to Mathias 
Colomb Cree Nation.” Yet, despite this apparent clarity, the chief still cannot be assumed to list an 
accurate, exhaustive conventional procedure for all apologies. Nevertheless, the Manitoba case is both a clear 


‘* http://globalnews.ca/news/1035273/rob-ford-super-sorry-for-comment-made-at-Mondays -council-meeting/meeting/ (at 57 
second mark). 
'? http://www.cbc.ca/news/canada/manitoba/chief-rejects-fishing-lodge-owner-s-apology- over-racist-brochure-1.2657927. 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 11 


instance of an apology that was a misfire, and it foregrounds that apologies can be refused. 

There are also instances of apparently felicitous public apologies, such as the apology issued on August 
18, 2010 by John Duncan, the Minister of Indian Affairs and Northern Development, on behalf of the 
Government of Canada and all Canadians, for the Inuit relocation to the “high Arctic desert”. Several Inuit 
families were moved in the 1950's some 1200 km north of their original residence, without the provision of 
proper shelter and with little food. The media did not report how many of them died in the process or even 
whether the well-worded apology was accepted by the survivors of the relocation or their children.’® Even 
the felicity of such a well-publicized public apology, as that given on June 11, 2008 by Prime Minister Stephan 
Harper “to former students of Indian Residential Schools” seems unknowable because the media focused 
on the speech given in the context of Canadian Parliament and wildly applauded by the MPs. The media did 
not offer coverage of whether the affected First Nations survivors gave forgiveness. Also missing was an 
apology to the parents of those kidnapped students.” 

How does one judge the felicity of an apology offered in the April 11, 2013 letter by Gord Nixon, 
President and Chief Executive Officer of the Royal Bank of Canada, whowas addressing the issue of the loss 
of Canadian jobs because of the Bank outsourcing jobs to India?’* On April 11, 2014, the Globe and Mail 
website included the RBC apology, but also posted the story “RBC apology met with skepticism on social 
media” alongside the apology,’’ which suggests that the apology misfired. While the Globe seemed to 
provide coverage that focused on outsourcing as a common and legal practice in Canada, the Starincluded 
coverage that was more sympathetic to the workers but also stressed that the apology only “comes at the 
end of a week of drubbing from RBC customers and labour critics.”*” What is observable is that the apology 
was only issued after the media revealed the problem. The delay and the fact that the apology was only issued 
after pressure suggests an infelicity: an abuse due to insincerity (I.i). Some apologies are only sparsely 
reported, such as Michael J. Fox’s apology to Taylor Swift over a remark he made. The apology was issued 
and accepted but the details remained between the two celebrities. 

No coverage of fauxpologies in Canadian media would be complete without at least making a nod 
toward Justin Bieber and his long list of both serious and asinine offences, which include insulting Bill 
Clinton,” kicking the Argentine flag off the stage,” showing up late for his London show,” and vomiting 
on stage.** Yet, an analysis of Bieber’s apologies in terms of Austin’s model tends to reveal a certain degree 
of felicity concerning his performative speech acts. A fauxpology is in its clearest form an apology that is 
infelicitous because it is in the abuse category. We can argue that it is a li fauxpology (abuse, insincerity). 
Although it may seem counter-intuitive, Bieber appears to be sincere. Taken individually, he usually does 
not repeat specific offences. True, he acts young and foolish but one also needs to look to the ever present 


'http://www.the-cma.org/about/blog/the-apology-and-the-world-of-social-media; http://www.aadnc- 
aandc.gc.ca/eng/1100100015397/1100100015404 

'” http://www.pm.gc.ca/eng/news/2008/06/11/pm-offers-full-apology-behalf-canadians-indian-residential-schools- 
system#sthash.UqdQ9L60.dpuf 

'S http://www.rbc.com/newsroom/news/2013/20130411-rbc-statement.htm 

'? http://www. theglobeandmail.com/report-on-business/rbc-apology-met-with-skepticism -on- social- 

media/article1 1063200/?from=11061489 


*° http://www.thestar.com/business/2013/04/11/rbc_chief_issues_open_letter_apology_to_canadians_over_outsourcing.html 


*! http://www.hollywoodreporter.com/news/justin-bieber-apologizes-bill-clinton-583902 

** http://hollywoodlife.com/2013/11/14/justin-bieber-apologizes-kicking-argentinian- flag-off-stage-twitter/ 

*> http://www.billboard.com/articles/news/1550613/justin-bieber-apologizes-to-london-fans-for-show-delay-citing-technical- 
issues 


** http://www.cbc.ca/news/arts/justin-bieber-apologizes-to-fans-for-vomiting-on-stage-1.1212535 


I KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


media that expects and encourages such bad boy behavior. If Bieber repeats the offence then, in Austin’s 
terms, his apology for the first misbehavior needs to be considered an abuse. On the other hand, we could 
take his offences as a pattern. To repeat Goffman’s paradigm: “[a]ny deviation ... on any one occasion when 
the rule is supposed to apply can give the impression that the actor may be delinquent with respect to the 
whole class of events” (1971:97). Yet, in terms of Goffman’s social structure Bieber is the epitome of a social 
personality, and because this “position [is] occupied by a human being in a social structure, the complex 
formed by all his social relations with others” (193-94) is media mediated and even encouraged. The celebrity 
context illuminates further the multiple roles the media plays in calling for an apology, setting the stage, 
applying pressure, constructing a narrative, publicizing and reaping the benefits of a complex process. 


7. CONCLUSION. As Kampf (2013:158) observes, “the greatest challenge is to differentiate between different 
types of infelicitous speech acts in an age of apology’. The media have always played a persuasive role in 
social relations, especially in political systems that tolerate freedom of speech, and it is also playing an 
increasingly significant role in public apologies. One such example is the reworking of video clips of public 
figures apologizing for their behavior in order to create a different salience. One factor that enables this 
influence is the breadth of events the media can draw from in global, national and local contexts. The easy 
access to mobile phones with their video capabilities allows almost anyone to upload video clips to various 
websites. These video clips are often taken from misleading angles and are truncated or edited so that the 
context of events cannot be known, and, therefore, may be easily manipulated to support various agendas. 
The new generation of software, which is often free, permits near seamless cuts and splices resulting in a 
reordering of events and creation of virtual pseudo-realities. The quick turnaround time from event, to 
information gathering, manipulation and publication to various media sites discourages the analysis of the 
video’s material and makes it futile. Unlike written texts, sliced and diced videos which often feature intense 
or absurd pantomimes or verbal exchanges also resist analysis because the original sequences are unavailable 
for analysis until much later, if at all. 

We have tried to show that there is no agreed upon, conventional format of apology. While this may 
occasionally cause problems in interpersonal communication in the private domain, with some good will 
people are usually able to sort out their differences, recognize the hurt they have caused and apologize in a 
way that satisfies the expectations of the offended party. However, when government officials, celebrities or 
public figures and the media are involved, the apology is usually staged, recorded, and the video and audio 
footage may be easily manipulated to fit particular agendas. If the purpose of such manipulation is to make 
news rather than make repairs, or if the speaker produces an apology in order to shift the blame or avoid 
liability, the apology will be infelicitous. It can be a misfire if the execution is flawed, or it can be an abuse 
if the speaker is insincere. In the latter case, the apology becomes a fauxpology, the phenomenon that gave 
rise to this study. There are many questions that could be investigated in this context: for example, the effects 
of culture or gender on the performance of apologies, both in private and in public.* A particularly serious 
question that we have posed but have not answered fully is whether there exists “an accepted conventional 
procedure having a certain conventional effect” and including “the uttering of certain words by certain 
persons in certain circumstances” (Austin 1962). If there is no agreement when an apology is felicitous, the 
reason may simply be that there is no one script for a successful apology and that it is not a true performative 
speech act. If so, perhaps we could accept that one who says that he is “super, super, super sorry’, but feels no 
remorse and will reoffend, is simply lying. 


*° We would like to thank the LACUS reviewers for this insightful suggestion. 


FAUXPOLOGY AS A PARASITIC SPEECH ACT 13 


REFERENCES 


AUSTIN, JOHN. 1962. How to do things with words. Oxford: Clarendon Press. 

BENNETT, CHRISTOPHER. 2008. Theapology ritual: A philosophical theory of punishment. Cambridge: 
Cambridge University Press. 

BENOIT, WILLIAM L. 1995. Accounts, excuses, and apologies: A theory ofimage restoration strategies. Albany: 
State of New York Press. 

BLANCHARD, K., &M. MCBRIDE. 2003. The one -minute apology. New York: Harper Collins. 

BLOOM, L. M. 2008. The art of the apology: How to apologize effectively to practically anyone. Springfield, VA: 
Green Angel Media. 

BOYD, DAVID P. 2011. Art and artifice of public apologies. Journal of Business Ethics. 104 (3):299-309. 

CHAPMAN, G., & J. THOMAS. 2006. The five languages of apology. Chicago: Northfield. 

FAUXPOLOGY, (nd) Urban Dictionary. Accessed on March 20, 2014 
http://www.urbandictionary.com/define.php?term=fauxpology 

GOFFMAN, ERVING. 1967. On face work. Chapter 1 in Jnteraction ritual: Essaysinface-to- face behavior. 
Chicago: Aline. 

---. 1971. Remedial interchanges. Chapter 4in Re/ationsin public: Microstudies ofthe public order. New 
York: Basic Books. 

HALION, KEVIN. 1992. Parasitic speech acts: Austin, Searle, Derrida. Philosophy today. 36.2:161-72. 

KAMPF, ZOHAR. 2013. The discourse of public apologies: Modes of realization, interpretation and 
mediation. In Public apology between ritual and regret: Symbolic excusesontfalsepretensesor true 
reconciliation outofsincereregret’ed.by Daniél Cuypers, Daniel Janssen, et al., 147-65. Amsterdam, 
New York: Rodopi. 

LAKOFF, ROBIN TOLMACH. 2001. Nine ways of looking at apologies: The necessity for inter- 
disciplinary theory and method in discourse analysis. In The handbook of discourse analysis, ed. by 
Deborah Schiffrin et al., 199-214. Oxford: Blackwell. 

LAZARE, AARON. 2004. On apology. Oxford: Oxford University Press. 

LIND, JENNIFER. 2008. Sorry states: Apologies in international politics. Ithaca: Cornell University Press. 

MACLEOD, LESLIE H., & Associates. 2008. A time for apologies: the legal and ethical implications of 
apologies in civil cases; Cornwall Public Inquiry Phase 2, Research and Policy Paper. April 12, 2008. 
Toronto. Accessed on November 15, 2014 at 
http://www.attorneygeneral.jus.gov.on.ca/inquiries/cornwall/en/report/research_papers/Phase_2_RP/3_ 
Macleod_Apologies.pdf. 

MCCALL, BRUCE. 2001. The perfect non-apology apology. New York Times, April 22, 2001, 2. 

NEWFIELD, JAMES S., ESQ. 2007. The evolution of the apology. Legal Issues. Home Health Care 
Management & Practice. February 2007 Vol. 19, Number 2, 137-39. Sage Publications. 

OWEN, MARION. 1983. Apologies and remedial interchanges: A study of language use in social interaction. 
New York: Mouton. 

PLATO. 2001. Apology. LOEB Classical Library. Trans. by Harold North Fowler. Cambridge, MA: 
Harvard University Press. 

---, Crito. 2001. LOEB Classical Library. Trans. by Harold North Fowler. Cambridge, MA: Harvard 
University Press. 

---, Euthyphro. 2001. LOEB Classical Library. Trans. by Harold North Fowler. Cambridge, MA: Harvard 
University Press. 


14 KEITH BATESON, PUSHP NARULA, &MAGDA STROINSKA 


RADCLIFFE-BROWN, R. 1952. On the concept of function in social science. Structure and function in primitive 
society: Essays and addresses. New Y ork: The Free Press. 
SMITH, NICK. 2008. Jwas wrong: Themeaningsofapologies. Cambridge: Cambridge University Press. 
TAVUCHIS, NICHOLAS. 1991. Mea culpa: A sociology of apology and reconciliation. 
Stanford: Stanford University Press. 


This article was first published at lacus.weebly.com. 





ACUS Forum 41:2.15-22 (2020) 






LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free of 






charge and with no implied warranty. Its contents are available to you under the terms of the Creative 
Commons Attribution-NonCommercial license version 3.0; 

see Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 


CHALLENGING THE PRESCRIPTIVE-DESCRIPTIVE 
DIVIDE: AN ANALYSIS OF WORDREFERENCE FORUMS 


MICHAEL D. KLIFFER 
McMaster University 


Abstract: This paper argues that, contrary to received linguistics wisdom, prescriptivism is not always worthy 
of condemnation, nor is descriptivism the prerogative of linguists only. I first discuss some linguists’ views 
arguing for aless dismissive stance towards prescriptivism, 

e.g. Milroy’s contention that any accurate description contains a prescriptive dimension because it must 
correspond to prevailing, non-puristic norms. I then give examples of the balanced treatment seen with non- 
linguist moderators of the site WordReference regarding French constructions which manifest a conflict 
between usage and prescriptive norm. Each of the 10 threads examined was classified according to 7 criteria, 
with the results showing a clear preference for the descriptive approach, i.e. not appreciably different from 
what we would expect from a study by linguists. 


Keywords: prescriptivism, descriptivism, register, norm, purism 


Languages: French 


THIS PAPER WILL ARGUE that the classical linguistics position requiring our discipline to dismiss 
prescriptivism needs rethinking. Contrary to the view that any serious linguistics theory must have a purely 
descriptive foundation, I will make the case that the division between description and prescription is not 
watertight. Evidence from activities which involve both linguists and non-linguists suggests that we will 
never fully understand language, whether synchronically or diachronically, unless we adopt a holistic 
perspective in which normative and usage-based approaches are not mutually exclusive. The empirical 
contribution of this paper will involve a look at threads by non-linguists in a WordReference French forum. 
The general impression of these contributions will be one of balance between observed usage and the dictates 
of language authorities, a stance which, I believe, gives credibility to these non-linguists’ assertions. 


1. THE CLASSIC VIEW. Typically, the first point in an introductory linguistics course is that our discipline 
is descriptive; this is what distinguishes it from traditional prescriptive grammar. I will avoid a rehash of 
the arguments supporting a descriptive stance, except to say that some introductions, perhaps the majority, 
leave the student with a black-and-white impression: prescriptivism invariably distorts the facts of language 
while descriptivism ensures observational and, ultimately, explanatory adequacy. This rejection of 
prescription is largely valid for fieldwork and analysis, but the latter two areas hardly exhaust the domains 
which involve our colleagues. Linguists also participate in language planning and consultations on 
standardization, whether in reforms to language teaching or the drafting of norms for yet to be codified 


16 MICHAEL D.KLIFFER 


languages. Such contributions are by their very nature prescriptive, at least in part. Moreover, an 
introductory linguistics course may imply that descriptivism is what separates the linguist from the 
language dilettante, whom our profession tends to dismiss as a reactionary and/or ignoramus. 


2. FURTHERARGUMENTS AGAINST THE DIVIDE. 


2.1. EMPIRICAL EVIDENCE. In sociolinguistics, register can be both descriptive and prescriptive. 
Descriptive because it is based on direct observation of usually spoken data or corpora. Prescriptive because 
in a second language pedagogical context, an awareness of register can guide the student to make formality 
choices that correspond to what a native speaker would opt for. Register is perhaps the most obvious case 
where prescription can be in sync with observation and description. For grammatical or phonological 
phenomena that vary by register, the same close connection can and typically does hold between description 
and norm. To paraphrase Cameron 1995, most descriptive statements would be suspect if they did not 
correspond to a norm, tacit or explicit.’ 

The reverse also holds, in that the validity of prescriptions tends to be assessed in terms of their 
correspondence to observed practices. The cases cited in introductory linguistics texts deal with the 
comparatively small number of discrepancies between puristic dictates and actual usage. In other words, if 
we inventory the rules given in a pedagogical grammar, prescriptive by its nature, we will see that the vast 
majority would not be at odds with a linguist’s description, at least for non-colloquial registers. The only 
exceptions to this matching would be prescriptive grammars which focus on native speakers’ purported 
mistakes or elementary-level grammars that by their nature omit intermediate- and advanced-level rules. 

The tendency of introductory linguistics classes to regard prescriptivism as unitary also prevents us from 
perceiving its links with descriptivism. The problem with lumping all prescription into a monolithic 
category is that we fail to distinguish between clearly detrimental aspects of it, such as language mavens’ 
attempts to suppress teaching Ebonics in American schools, and circumstances where it is indispensable, 
as in a second language class. A more nuanced view of prescriptivism would present it as a continuum, 
ranging from realistic instances of e.g., register discrimination to purism, which aims at condemning all 
foreignisms and any other non-standard usage. This is a distinction which Thomas 1991 focuses on, 
although he unfortunately chooses the term purism as the all-embracing synonym for prescription. In my 
view, a language teacher who alerts her class to differences between formal and colloquial usages is being 
prescriptive but not purist, i.e., purism is best viewed as an extreme sub-type of prescriptivism. Our look at 
an on-line language forum, Wordreference will show that non-linguist native French speakers typically 
espouse a moderate, usually non-puristic approach to prescriptivism. While giving norms their due 
recognition, the forum participants don’t hesitate to add a realistic acknowledgment of usages which often 
conflict with the norm. 

There is evidence that stylistic judgments, by their very nature prescriptive, exist throughout the world’s 
language communities, literate or not, as Bloomfield 1927 showed for Menomini. Every human’s linguistic 
intuition would thus appear to contain an inherently normative side. One area of study outside the 
traditional core of linguistics but clearly linked to our discipline is conversational competence. Ninio & 
Snow 1996 discuss children’s deficiencies in managing conversation. Their comparison of child and adult 
competencies in this area recalls a prescriptive manual, while they could also claim they are simply giving a 
description of such competencies. Their introductory sketch of adult conversational abilities seems highly 
idealized (e.g., minimize interruptions and overlaps, one must speak at some points and avoid speaking at 


‘Cited by Cameron 1995:7, Milroy 1992 contends that any descriptive statement ultimately has a prescriptive dimension because 
it must correspond to the implicit norms prevailing within a language community 


CHALLENGING THE PRESCRIPTIVE-DESCRIPTIVE DIVIDE 17 


others, etc.). Their sketch undoubtedly lends itself to a formality-level account, i.e., only in dramatic works or 
serious public events like a round table discussion or political debate are speakers expected to adhere 
rigorously to the norms, which arguably integrate both social and linguistic know-how. In an informal 
setting, interruptions, silences, and sudden topic shifts are inevitable and don’t necessarily manifest 
incompetence, no more than using ain twould in a working-class bar setting. The dividing line between a 
descriptive account of conversational competencies and an evaluation of them, necessarily reflecting 
prescriptive assumptions, is thus far from obvious. Parallel examples are not hard to find in phonetics, syntax, 
and discourse, where the neatly delineated principles of formal styles give way to the variation, parataxis and 
inconsistencies of the colloquial. 


2.2. THE VALUE OF SOME PRESCRIPTIVISM. Even though much amateur judgment of language is ill- 
informed and/or elitist, some prescriptive initiatives serve legitimate social ends, such as attempting to 
resolve social inequalities, as seen in feminists’ efforts to bring about more inclusive language and the 
movement to render legal and other specialized texts more accessible to the public.’ 


2.2.1. VERBAL HYGIENE. Verbal hygiene is the term proposed by Cameron 1995 for the global human 
tendency to evaluate language. Verbal hygiene activities run the gamut from rants on the use of apostrophes 
and split infinitives to debates on linguistic gender- inclusiveness to making technical documents more 
accessible to the layperson. As a linguist, Cameron is the first to agree that some verbal hygiene pursuits are 
trivial or simply wrong-headed. She contends that virtually all non-linguists assume that language is by its 
nature subject to normative constraints, a position shared at least tacitly by many, if not most linguists. 
Hence, we do our discipline no service by shucking off challenges from purists as irrelevant to linguistic 
science. A case in point discussed at length by Cameron was the debate over re-introducing grammar in 
UK schools during the 90's. This initiative came mostly from Thatcherite moral conservatives as part of an 
attempt to instill “discipline” in the schools and maintain a sacred repository of grammatical wisdom. The 
reaction from linguists was typically “this is not our concern.” Asa result, the conservatives won the public 
relations battle against the linguists, portraying the latter as apologists for students’ plummeting grammar 
prowess. For Cameron, linguists missed a chance to instigate and direct a serious public debate on the 
nature of grammatical rules. I bring up this issue to show how prescriptive norms drive public perception 
of language and for linguists to ignore or trivialize norms prevents us from exploring a phenomenon that 
at times shapes human language. 


2.2.2. FIXING ENGLISH. As confirmation that prescriptivism remains a topic of debate in linguistics, 
Curzan (2014: 66) looks into issues both longstanding (e.g., how to define a word) and recent (e.g., Word's 
grammar checker) that point up prescriptivism as a legitimate sociolinguistic trait and a meta-discourse on 
language. Curzan declares that it is time we reflected on the power of MS Word's grammar checker, which 
has acquired influence far surpassing any individual English teacher's, thanks to the market dominance of 
Word. The checker’s lack of any author’s name gives it the status of a supposedly neutral authority but many 
of its purported corrections are stylistically controversial among respected authors, e.g., the flagging of ‘and’ 
at the beginning of a sentence. Recalling Cameron’s position on verbal hygiene, Curzan concedes the utility 
of this prescriptive device, but she insists that it should be considered fair game for debate among both 
linguists and professional writers. 


* See Cameron 1995: 28, 29, 72 for discussion of the Plain English Campaign. 


18 MICHAEL D.KLIFFER 


3. ALOOK AT NON-LINGUISTS’ META-LINGUISTIC COMMENTS ONFRENCH. 


3.1. METHODOLOGY. My plan is to determine the relative frequency of prescriptive and descriptive 
comments in WordReference, a site devoted to answering lay people’s queries about language. With one 
exception taken from a bilingual forum for French learners, I have restricted my study to the most advanced, 
all-in-French forum. Lower-level forums understandably tend toward the prescriptive because the queries 
are usually from learners not yet ready for an in-depth discussion of variation. The advanced forum deals 
mainly with lexical distinctions and grammatical usage. The grammar threads divide into nuances which 
near-native speakers are typically unsure about, and phenomena where there is a clash between everyday 
usage and the prescriptive norm, of concern to both native speakers and advanced non-native learners. 

As for the background of WordReference moderators and senior contributors, I surmise that they are 
not linguists because first, they never self-identify as such and secondly, I have never encountered in their 
comments terminology from any specifically linguistic theory. They do, however, have in-depth 
metalinguistic knowledge of French morpho- syntax and lexicon, both normative and colloquial; this 
knowledge is expressed in either layman’s language or with traditional grammar terms. They may well be 
French teachers, but given the French passion for language matters, they are also likely to be non-professional 
language enthusiasts. I examine only posts which address variation and are from French native speakers, 
both European and Canadian. 


3.2. ASAMPLING OF THEDATA. 


3.2.1. | HTTP://FORUM.WORDREFERENCE.COM/SHOWTHREAD.PHP?T’816051. In the first thread, a native 
speaker forum member asks whether the present or imperfect subjunctive is correct in a clause introduced 
by the conjunction aprés que ‘after’. The moderator replies that neither is right because this conjunction 
requires the indicative. Another participant adds that such a “mistake” is frequent throughout France. A 
senior member responds that since French is a living language, undergoing constant change, it is important 
to recognize that the present subjunctive in oral contexts is well-attested. A highly respected usage manual, 
Grevisse 2011, starts by giving the rule to use the indicative but then provides a wealth of examples from 
famous authors where the subjunctive crops up. The inference is clearly that in such eminent company, the 
subjunctive with aprés que could hardly be considered a mistake. Another senior member retorts that the 
subjunctive is found in the stream of consciousness of a character in Sartre’s novel La Nausée, so the 
example does not really pertain to Sartre’s own usage. The senior member who had brought up Le bon Usage 
claims that even if Sartre would avoid the non-standard construction in his own speech and writing, its use by 
one of his characters surely reflects something going on in Sartre’s sub-conscious. 

This thread is of interest for several reasons. First, we see the longstanding clash between the 
prescriptive norm (aprés que plus indicative) and everyday usage (apres que plus subjunctive). Arguments 
on many other issues reflecting the norm/usage split go back to at least the seventeenth century, when the 
grammarian Vaugelas attempted to record how the aristocracy actually spoke, a project strongly opposed 
by the newly established and thoroughly purist Académie Francaise. Such debates are a cornerstone of 
Frenchculture, as seen in newspaper language chronicles, and far more widespread than in English-speaking 
countries. 

Secondly, these controversies could be seen as an ongoing sign that descriptivism is alive and well 
outside linguistics. Contrary to the view often taught to beginning linguistics students, non-linguist language 
experts are not necessarily purists. This WordReference thread as well as most of the other 9 perused for this 
paper give voice to both positions, with the descriptive stance dominating. 


CHALLENGING THE PRESCRIPTIVE-DESCRIPTIVE DIVIDE 19 


Thirdly, there is the final comment that Sartre’s subconscious must come into play to explain the non- 
standard usage in his character’s speech. The contributor probably lacks familiarity with linguistics, as seen 
in his simplistic allusion to the sub-conscious. French pragmaticians and enunciation scholars (see 
especially Anscombre/Ducrot 1983, Ducrot 1984 and Anscombre 2009) would account for this discrepancy 
as an instance of polyphony, where the character’s linguistic knowledge comes across as distinct from the 
author’s. It is likely that Sartre consciously chose the subjunctive for his character, who is neither a language 
specialist nor aliterary scholar bound to the prescriptive norm, but a Francophone of average education who 
has no reason to speak ultra-correctly. 


3.2.2. http://forum.wordreference.com/showthread.php?t 112147. Another thread on the same issue starts 
with an Anglophone requesting advice on the appropriate mode. Some members respond that rule-abiding 
structures with the future perfect like aprés gu 7i/ sera parti ‘after he will have left’ may shock the ear, while 
another member counters that media people are increasingly going “back” to the indicative. She concludes 
that the semantic difference between avant que‘before’ and aprés que‘after’ justifies their requiring different 
moods. On the usage side, we have the argument of naturalness, i.e., the non-normative option with the 
subjunctive is said to sound better, while a prescriptivist invokes semantic contrast: the subjunctive and the 
indicative are associated with semantically distinct triggers, hence communicative accuracy requires us to 
maintain the distinction in mood. Again we see the tension between everyday intuition and the prescriptive 
norm, with the non-linguists at WordReference to be found on both sides of the debate. 


3.2.3. http://forum.wordreference.com/showthread.php?t 22452501. The third example brings in register. 
This is a discussion about the compound past (‘passé composé’) vs the double compound past (“passé 
surcomposé’) in C’estseulementapreés qu une rafalede vent a chassé les effluves qu’on a repris nos esprits ‘It 
was only after a gust of wind blew away the odours that we came round’. Some contributors insist on the 
traditional requirement for the double compound tense because the first verb is anterior to the second: C'est 
seulement apres qu'une rafale de vent a eu chassé ... *...had blown away...’. Others insist that just the single 
compound tense in such remoter past contexts is frequent and correct, the double compound alternative 
belonging to a higher register. Several arguments ensue about this limitation, with 2 of 5 members arguing 
that the double compound belongs to mainstream spoken French and is heard throughout France. The 
double compound may well reflect the official norm, but the majority of senior members see nothing 
untoward in using just the compound past instead. 


3.2.4. http://forum.wordreference.com/showthread.php?t’ 1084547. Example 4 deals with occurrences of 
subjunctive vs indicative after guoique and bien que ‘although’. The thread is the only one retained which 
is not from the French-only group. One bilingual respondent resorts to semantics: if the verb states a fact, it 
is in the indicative in the concessive clause: guoique j ai trouvé que... ‘although I found that...’, but if there 
is uncertainty, it’s in the subjunctive Quosqu il fasse le martyre... “Even though he may be playing the 
martyr...” He is contradicted by another native speaker who claims that “In contemporary French, only 
the indicative mood is correct after guoique.” 

Three further exchanges result in uncertainty about whether the indicative should ever be allowed after 
these concessives, for which the prescriptive norm mandates the subjunctive. The moderator provides the 


*In support of this claim, my own Google search with this conjunction and trouver produced over 500,000 hits for the indicative 
vs 7 for the subjunctive, the main factor favouring the indicative being the first-person subject. In other words, asserting that I or 
we found something makes it more likely to be a fact than if the subject is other than first person. Similarly, the subjunctive 
Quoiqu il fasse beau‘Even though the weather may be nice’ produced 90,000 hits vs over 3,000,000 for the indicative Quoiqu “l fait 
beau. 


20 MICHAEL D.KLIFFER 


following summation: most frequently, a grammatical servitude automatically triggers the subjunctive, and 
this is in accordance with the Académie's position. However, since the subjunctive can't distinguish between 
present and future, or certainty and contingency, indicative forms do arise, such as the future and the 
conditional. 

A similar variation pertains to the past/present subjunctive contrast: modern French has dropped the 
imperfect and pluperfect subjunctive tenses, except in formal or archaic registers. The compound past 
subjunctive is the most usual means of indicating past time but it now competes with the indicative 


) 


compound past, as in Quoiguils ont été.. ‘Although they were/have been... 
>4 


and Quoiqu ils aient été 
‘Although they were/have been/may have been 

For the mood contrast in the above examples, the indicative had nearly 4 times as many hits as the 
subjunctive: guoiqu isaientétéwith 875,000 vs guoiqu ils ontétéwith 3,180,000. These usage findings show 
how far the selection of mood is from being a servitude grammaticale. They confirm the moderator’s 
position that the prescriptive norm gives but a partial, simplistic, picture. 


3.2.5 http://forum.wordreference.com/showthread.php?t’2579353. The final example has a purist’s 
rejection being refuted by data. An American asks if the negation marker pascan ever be separated from the 
adverb encore ‘yet’. One European francophone respondent is categorical: the two must always be kept 
together, as stipulated on a grammar site. Another begs to differ, citing numerous counter-examples from 
literary sources spanning the first half of the 20" century. To the prescriptivist member, these data are out of 
sync with current practice. The second francophone responds with current counter-examples, which the first 
still dismisses as marginal. The moderator diplomatically and realistically concludes that non-natives 
should keep pas and encore together because that is the most frequent configuration, but it’s certainly not 
a mistake to separate them. We see that it’s not just linguists who adopt a usage-based approach. 


3.3 WORDREFERENCE CONTRIBUTORS’ LANGUAGE ATTITUDES. The total number of threads 
examined for the purposes of attitude classification is 10. Where the total of yeses and nos doesn’t add up 
to 10, that means the criterion was either irrelevant or it evoked a mix of positive and negative results for at 
least one thread. Here are the criteria: 
1) Presence of Prescriptive Markers (must, should, etc.) Results: Yes 9; No 1 
9 out of 10 threads had at least one occurrence of these markers, which are reliable signals of at least some 
degree of prescriptivism. This result is hardly surprising, since the typical WordReference post initiating a 
thread, by the very nature of the Forum, solicits a recommendation. 
2) Descriptive Accuracy, e.g., does the thread recognize variation? Results: Y 10; N 0 
All 10 threads acknowledged variation in native speakers’ usage. I see this as a counter- weight to the 
previous criterion, i.e., as an indicator of the balance between prescriptivism and descriptivism. 
3a) Does a Thread Attempt to Explain the Variation? Y 8; N 2 
8 out of 10 threads accompanied the acknowledgment with an attempt at explanation. This contrasts with 
the prescriptive stereotype for which the rule itself suffices as an explanation. 
3b) Does a Thread Involving Variation Cite Register or Other Sociolinguistic Factors? 

Results: Y 6; N 2 
While a sociolinguistic reason is generally a desideratum for a linguist, the typical user of WordReference 


* The Google results for these mood alternations confirm the high incidence of varation. The future indicative token guoiqu ‘ilsera 
peut-étre‘although he will perhaps be’ garnered 376,000 hits while the subjunctive guoigu 7 soit peut-étre gave 1,370,000 hits. The 
conditionnal guoiqu i/seraitproduced 33,700,000 hits vs 26,100,000 for the subjunctive quoiqu “soit. 


CHALLENGING THE PRESCRIPTIVE-DESCRIPTIVE DIVIDE ZA 


is more interested in usage and norm as guides for correctness. It is thus noteworthy that 6 threads did bring 

in register factors, typically the speech/writing divide. The other 2 offered a semantic explanation.” 

3c) Does a thread favour one variant only, on puristic grounds like tradition or standardization? 
Results: Y 2; N 7 

Again, with 7 of 10 threads avoiding judgment, the dominant treatment was to “tell it like it is” for each 

variant i.e., the standard one was presented as the normative preference, while the non-standard variant was 

the one favoured by usage. 

4a) Does the Thread Acknowledge a Breach Between Usage and Norm? Results: Y 7; N 0 

Of the 7 cases where this question was relevant, all acknowledged the breach. 

4b) Did Acknowledgment of the Breach Occur Without Condemnation of the Colloquial Variant? 
Results: Y 5; N 2 

Of the 7 relevant cases, 5 avoided such condemnation. The 2 other relevant threads dismissed the 

counter-normative usage as a mistake. 


4. SUMMARY AND CONCLUSION. Over the last two decades, much linguistic thinking has already 
called into question the binary prescription is bad vs. description is good. In the nineties, several 
contributors to Linguist List 1997 had a lively debate about how linguists should regard the prescriptive. 
While no one questioned the appropriateness of an exclusively descriptive approach when an account of 
linguistic structure is the objective, several scholars noted that some linguists do their work in areas 
involving at least some prescription, such as dictionary-making. One linguist, Jacques Guy, after observing 
the huge descriptive component of Grevisse’s landmark Le bon Usage, even wondered if prescriptivism is 
not a straw man. 

This paper shares Cameron’s view that linguists should reflect more on the nature of prescriptivism and 
its multifarious repercussions, ranging from head in the sand simplistic or puristic to serious attempts to 
address language-based social shortcomings and communication barriers. My main objective has been to 
question the parallel simplification that descriptivism is the prerogative of linguists while lay writers on 
language favour prescriptivism. A close examination of threads mainly in the WordReference French Only 
forum revealed, on the part of the moderators, a consistent balance between an acknowledgment of 
prescriptive norms and a respect for usage. This balance is all the more noteworthy, given the centuries-old 
purist tendency of language commentators in the French-speaking world. Bakstr6m 2006 found a similar 
split in attitudes toward English grammar among subjects in Swedish schools and universities, and with the 
same preference for descriptivism. 

In sum, what this preliminary study strongly suggests is a coexistence of prescription and 
acknowledgment of non-standard variation, at least among French language experts.° 


> #4 mentioned Sartre’s subconscious, #5 and #8 mentioned no factors, #7 had a contributor claiming that est-ce gue vous pensez 
‘do you think’ triggers the indicative (Google results refute this). 

° The total of 10 threads is obviously too small for statistically valid inferences and will be increased with ongoing data perusal, with 
greater focus on lexical variation such as vigesimal vs decimal numbering. I also plan to go beyond binary criteria by ferreting out 
correlations between prescriptive/descriptive bias and the nature of the grammatical or lexical choice. For example, an initial hunch 
is that because productive morpho-syntactic phenomena like the subjunctive vs indicative contrast have received greater attention 
from guardians of the language, there may be a syntactic phenomena lingering prescriptive bias toward them. On the other hand, 
handling of lexical questions may tend more to the descriptive because, since they outnumber morpho-, each particular lexeme is 
less likely to have come under purists’ scrutiny. 


Lz MICHAEL D.KLIFFER 


REFERENCES 


ANSCOMBRE, JEAN-CLAUDE. 2009. La comédie de la polyphonie et ses personnages. Langue francaise 
164: 11-31. 

---, & OSWALD DUCROT. 1983. L argumentation dans /a langue. Brussels: Mardaga. 

BAKSTROM, ANN-MARIE. 2006. Prescriptivism and descriptivism: A study on attitudes towards 
language. Bachelor's Thesis. Lulea University of Technology (Sweden). 

BLOOMFIELD, LEONARD. 1927. Literate and illiterate speech. American Speech 2:432- 38. 

CAMERON, DEBORAH. 1995. Verbal hygiene. London/New York: Routledge. 

CURZAN, ANNE. 2014. Fixing English. Cambridge: Cambridge University Press. 

DUCROT, OSWALD. 1984. Le dire et le dit. Paris: Les éditions de minuit. 

GREVISSE, MAURICE. 2011. LE BON USAGE. 15"EDITION BY ANDRE GOOSSE. PARIS: DE BOECK- 
DUCULOT: 

LINGUIST LIST 1997. Vol-8, 1764-1774. http://www. linguistlist.org/. 

MILROY, JAMES. 1992. Language variation and change. Oxford: Blackwell. 

NINIO, ANAT, & CATHERINE E. SNOW. 1996. Children as conversationalists. In Pragmatic development: 
Essays in developmentalscience, 143-91. Ed. by A. Ninio &C. E. Snow. Boulder, Colorado: Westview 
Press. 

THOMAS, GEORGE. 1991. Linguisticpurism. London/New York: Longman. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:2.23-36 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





BUILDING MULTILINGUAL COMPETENCE AND THE 
“AWAKENING TO LANGUAGES” APPROACH: 
ONLINE AUTOMATIC TRANSLATION TOOLS IN 
MULTILINGUAL TEACHING MATERIAL DESIGN 


EFTYCHIA DAMASKOU 
University of Thessaly 


Abstract: This paper asks whether online machine translators are effective in the design of multilingual 
teaching material through a comparative study of two pieces of research. The investigation tool in both cases 
was a teaching unit of 6 activities working on several foreign languages the students were not familiar with. The 
first one involved 115 pupils who were invited to do parallel work on the same linguistic content in Albanian, 
Swedish, German, Turkish and Romanian. The second one involved 117 pupils who had to work in parallel 
on four foreign languages they had never been taught: Hungarian, Dutch, Portuguese and Swedish. The 
linguistic content was exclusively retrieved from three online translators and confirmed by native speakers. 
The first piece of research aimed at raising awareness of multilingualism. The second one involved the 
building of plurilingual competence in foreign language class. The data were collected through a qualitative 
analysis of the pupils’ worksheets. Our conclusions concern limits and potentials of machine translators in 
the design of multilingual teaching material. 


Keywords: online machine translation, multilingualism, plurilingual competence, multilingual teaching 
material 


Languages: Albanian, Dutch, German, Hungarian, Portuguese, Romanian, Turkish, Swedish 


THE EUROPEAN UNION counts 24 official languages, and a considerable number of people speak more 
than two languages in their everyday life (Cenoz et al. 1998). In 1992, the Maastricht treaty established the 
free movement of persons, goods, services and capital (Stanicic 2003). In addition, it seems that the 
significant internal and external flow of immigrants (Asia, Africa) has contributed to the European 
linguistic and cultural mosaic. As regards education, there have been several projects whose aim was to 
enrich the student repertoire of languages in order to comply with the needs of modern European society. 
Thus, several projects were made with the aim of raising student awareness of the languages members of any 
given class speak or are interested in (Filder 2006). In this context, the European language policy during the 
last 15 years has leaned on plurilingual education, based on ICT. This is mainly carried out through the 
adoption of pluralistic approaches which aim at the building of plurilingual competence (Candelier, 2007). 
The question here is whether online machine translation helps develop teaching material. 


1. RESEARCH RATIONALE. In our research the term plurilingual competence refers to the ability to use 


24 EFTYCHIA DAMASKOU 


languages for communication purposes and to take part in intercultural interaction, where a person has 
proficiency, of varying degrees, in several languages and experience of several cultures (Council of Europe 
2001; Coste et al. 2007). In order to integrate multilingualism in a precise methodological framework, 
Candelier (2007) mentions the pluralistic approaches, defined as teaching approaches which are based on 
activities that implicate at the same time many linguistic and cultural varieties (Candelier 2003). The 
awakening to languages (hereafter ATL) approach involves learning activities that are concerned with 
languages which the school generally does not intend to teach (Candelier, 2003). According to this 
approach, the activities proposed favor passing from one language to another through the use of prior 
knowledge. Working in the ATL framework is about emphasizing linguistic relationships that explain 
linguistic affinities and differences. 

As regards the use of online machine translators (hereafter MTs) in language class, the relevant scientific 
literature testifies that teachers often use the MTs in order to illustrate the differences between languages 
(Anderson 1995; McCarthy 2004). MTs can be employed as a powerful focal point in L2 learning by 
developing a set of requirements and procedures called a learning algorithm (Anderson 1995). Rischoff 
(1993) refers to the application of computational technologies as having enhanced language learning, 
implying that these resources should be designed “to facilitate language learning and language acquisition 
processes’. In general, wide availability, immediacy, and a large range of languages appear to be among the 
positive features of MTs. They are effective with lexical translation and with repetitive and simply- 
structured texts. However, their output includes many grammatical and orthographic errors, letter 
omissions, or unnecessary words, and usually keeps original structures of source languages. Further, MTs 
are unable to account for cultural references (Nifio 2009). MTs can also be much cheaper than human 
translators (Vitek 2000) and their use can prove vital in the massive production of professional documents 
(Harris 2010). Online MTs are also good with repetitive and simply-structured texts, as they work 
reasonably well with straightforward texts such as weather reports or technical manuals (Nifio 2009; Napier 
2002). Nevertheless, outputs of free online translators include many errors and keep original structures of 
source languages. They also contain inaccuracies in connectives and co-reference between the sentences, 
and punctuation and capitalization errors. In addition, online MT tools are unable to account for cultural 
references and sometimes present unnatural writing. In fact, as Melby (1995) explains, computers “are very 
likely to produce atrocious results [unless the text treats] a very narrow topic in a rather dry and 
monotonous style”. In other words, MTs require an enormous amount of human intervention to maintain 
quality control. As Budiansky (1998, cited by Harris 2010) notes, “Language is full of ambiguity and multiple 
meanings that a correct reading of syntax goes only a short way toward sorting out [because] computers 
don’t have any common sense.” 


2. OUR RESEARCH. The main question of this paper is how online MT tools could be used in developing 
multilingual teaching material to raise awareness of linguistic diversity and development of plurilingual 
competence. In the first case, we chose to use online MTs in order to have free and immediate access to 
valid lexical content, in a large range of languages (and especially the less common ones), in order to 
facilitate our cross-linguistic approach. As regards the second case, the online MTs would result in free and 
immediate access to a large number of languages, in order to study the pragmatic, grammatical and lexical 
aspects of European linguistic diversity. 


2.1. 0UR RESEARCH METHODOLOGY. Both pieces of research were based on teaching units conceived 
according to the ATL framework. In both, the teaching material used consisted of a series of activities that 
invited pupils to work through a cross-linguistic approach on several unknown foreign languages. In both, 


BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 25 


we were careful to diversify the type of activity (matching activities, multiple choice, brainstorming, free 
production). At the same time, we focused on the implication of cognitive and linguistic skills such as meta- 
linguistic awareness, knowledge transfer, and cross-linguistic influence, through exploration, analysis, 
reflection and comparison tasks. The data collection tools consisted of thepupils’ answer sheets as well as 
the remarks in a personal logbook concerning pupil participation rates, correct answers, and pupils’ 
inquiries and thoughts. 

The first piece of research took place in 2012 and our sample included 115 5" and 6" graders. Our main 
investigation tool was a teaching unit which had a series of six activities. The unit had a four-part structure 
in the following order: an awakening part, a conceptualization part, a systematization part, and, finally, a 
part of application-evaluation. The activities invited pupils to work in parallel on the same linguistic content 
(vocabulary, grammar and syntax) in 5 languages (Albanian, Swedish, German, Turkish and Romanian) they 
had never been taught. The language selection criteria were the following: Albanian was chosen because it 
is the language of the majority of immigrants in Greece, so there are many children who speak this language 
at home; Romanian was chosen because it is the language of a neighboring Balkan country, a very good 
partner in many European projects. This language is also related to French, which is taught at Greek school 
as a second foreign language. The selection criteria for German included the goal to learn its characteristic 
accent. The main reasons for choosing Turkish was its geographical proximity along with its lexical 
similarities to Greek (Dimela 2013). Swedish was chosen for its similarity with German (a very common 
foreign language), but also because, being a Nordic European language, it is not very popular in Greece. 

Our second piece of research took place in 2013 and involved 117 5" and 6" graders. The unit theme 
was: ‘Invite a foreign friend to my house”. The proposed activities asked pupils to study through a parallel 
approach, four foreign languages unknown to them: Hungarian, Dutch, Portuguese and Swedish. We chose 
Dutch because it is a language related to English and German, and Swedish because it is not very popular in 
Greece despite the masses of Scandinavian tourists visiting Greece during summer. We also chose 
Portuguese because it is related to French (the 2° foreign language learnt by the pupils at school). Finally, 
Hungarian was chosen because it doesn’t belong to the Indo-European language family and ATL requires 
the involvement of languages of any language family (Candelier, 2003). 

All linguistic content was retrieved from four online automatic translators using the interlingua-based 
system. The criteria for the use of these translation tools include free accessibility, lack of cost, 
immediateness, easy output processing and the variety of possible combinations among languages. We 
opted to use more than one translator, in order to avoid targeting a specific online translator, but also to 
extract an objective and valid output. The correctness of the linguistic material retrieved by the online 
translators was verified by a search engine in the first piece of research and by native speakers in their 
respective languages in the second piece of research. 

Table 1 presents the activities given to our sample in the first research. 


Table 1. The instructions given in the I* piece of research. 
languages you heard: German, Swedish, Turkish or Romanian? 
Listen again to the previous recording in a different row 
and match them with the correct written form, 


choosing from the following word groups. 


Guess the meaning of the following words and try to justify your answer. 


4" act. Guess the meaning of the following phrases and identify in which 
language they are, from the previously studied languages. 





26 EFTYCHIA DAMASKOU 


> act: Match the texts of the left column to those in the same language appearing 
in the right column, and then explain your choices. 


With your school music band « Mikri Tenori » you are organizing 


concerts in several countries around Europe and East, such as Albania, 
Romania, Sweden, Turkey, and Germany. You and your classmates 
decided to create on your own the concert posters for each of the 





countries in their language. 
Table 2 presents the instructions given for the activities proposed in the second piece of research. 


Table 2. The instructions given in the 2™ piece of research. 


ee Instructions 


You are going to listen to four groups of words. Guess in which 
language they are: Portuguese, Dutch, Swedish or Hungarian? 


2™4 act.| You are going to listen to the same groups of words. Try to match what 
you hear with one of the following groups. 


Cross out the word that does not match with the others. 
4 act. | Observe the following arrows. Towards which direction should we turn 
order to go to school, to the park, to the museum, to the police station, 


and to the hospital? 


they are (Portuguese, Hungarian, Dutch or Swedish), and c) try 
to find their meaning. 


5 act. | Observe the following sentences: a) underline the words that you 
already know or those that sound familiar, b) Guess in which language 


6" act. | A Portuguese/Hungarian/Swedish/Dutch friend of yours would like 
to visit you. You are sending them an email in their language in 
order to explain how they could get to your home. Fill in the following 
email form by putting the words that you are given in the following 
order: I live at 30 Venizelou Street. Take the bus number 40 and get 
off at "Venizelou" stop. My house is next to the post office. Go straight 
ahead and turn right. 





2.2. OUR RESEARCH RESULTS. In this section, we compare the data retrieved from the online MTs in 
both pieces of research. Figure 1 shows the percentage of errors detected in words retrieved in the first piece 
of research. These words were used in the first three activities. 


BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 2/ 


17.50 
0 14% 


O O 


IstMT 2ndMT 3rdMT 4th MT 


Figure 1: Errors detected in word output in the 1° : piece of research 


Figure 2 presents the percentage of errors detected by the native speakers in the sentences retrieved from 
the online MTs. 


35% 40%—a2q7 40% 


IstMT 2ndMT 3rdMT 4th MT 





Figure 2: Errors detected in phrase output in the first piece of research 


The data received concerning the words needed for our teaching unit are presented in Table 3. We 
systematically avoid referring to the MT brands in order to avoid any possible legal problems with their 
administrators. For this reason, we replaced the relevant sites by the abbreviation MT and a number. 


Table 3. The word output received in the I* piece of research. 

Theater MTI1: Theater (DE), teater (SE), Tiyatro (TR), Theatre (RO). 

MT2: Theater (DE), Teater (SE), Tiyatro (TR), Theatre (RO). 

MT3: Theater (DE), teater (SE), tiyatro (TR), teatru (RO), 

teatri (AL). 

MT4: Theater (DE), teater (SE), Tiyatro (TR), Theatre (RO). 

Nat. speaker: Theater (DE), teater (SE), tiyatro (TR), 

teatrul (RO), teatri (AL) 
Stadium MT1: Stadium (DE), stadion (SE), stadium (TR), 

stadium (RO). 

MT2: Stadion (DE), stadium (SE), stadyum (TR), 

stadium (RO). 

MT3: Stadion (DE), stadion (SE), stadyum (TR), stadion(RO), stadiumi(AL). 

MT4:Stadion (DE), stadium (SE), stadyum (TR), stadion (RO). 

Native speaker: Stadium (DE), stadium (SE), stadyum (TR), 

stadion (RO), stadiumi (AL). 





28 


EFTYCHIA DAMASKOU 


MT1: Klavier(DE), piano(SE), piyano(TR), pian(RO). 

MT2: Piano(DE), piano(SE), piyano(TR), pian(RO). 

MT3: Klavier(DE), piano(SE), piyano (TR), pian (RO), 

piano(AL). 

MT4: Piano(DE), piano(SE), piyano(TR), pian (RO). 

Native speaker: Klavier (DE), piano(SE), piyano(TR), 

pian(RO), piano (AL). 

MT1: BafS (DE), bas (SE), bas (TR), bas (RO). 

MT2: Bass (DE), bas (SE), bas (TR), bas (RO). 

MT3: Bass (DE), bas (SE), bas (TR), bass (RO), bas (AL). 

MT4: Bass (DE), bas (SE), bas (TR), bas (RO), 

Native speaker: Bass (DE), bas (SE), bas (TR), bass (RO), 

bas (AL). 

MT1: Guitarre (DE), gitarr (SE), gitar (TR), chitara (RO). 

MT2: Gitarre (DE), gitarr (SE), gitar (TR), chitara (RO). 

MT3: Gitarre (DE), gitarr (SE), gitar (TR), chitara (RO), 

kitaré (AL). 

MT4: Gitarre (DE), gitarr (SE), gitar (TR), chitara (RO). 

Native speaker: Gitarre (DE), gitarr (SE), gitar (TR), 

chitara (RO), kitaré (AL). 

MT1: Mai (DE), Maj (SE), Mayis (TR), mai (RO) 

MT2: Mai (DE), maj (SE), Mayis (TR), mai(RO). 

MT3: Mai (DE), maj (SE), Mayis (TR), mai (RO), Maj (AL). 

MT4: Mai (DE), maj (SE), Mayis (TR), mai (RO).Native 

speaker: Mai (DE), maj (SE), Mayis (TR), Mai (RO), Maj (AL). 
MT1: Juni (DE), Juni (SE), Haziran (TR), Iunie (RO). 

MT2: Juni (DE), juni (SE), Haziran (TR), Iunie (RO). 

MT3: Juni (DE), juni (SE), Haziran (TR), iunie (RO), Qershor(AL). 
MT4: Juni(DE), juni (SE), Haziran (TR), Iunie (RO). 

Native speaker: Juni (DE), juni (SE), Haziran (TR), 

Iunie (RO), Qershor (AL). 

MT1: Freitag (DE), Fredag (SE), Cuma (TR), vineri (RO). MT2: Freitag (DE), 
fredag (SE), Cuma (TR), vineri (RO). 

MT3: Freitag (DE), fredag (SE), Cuma (TR), vineri (RO), e premte (AL). 
MT4: Freitag (DE), fredag (SE), Cuma (TR), vineri (RO). Native speaker: 
Freitag (DE), fredag (SE), Cuma (TR), Vineri (RO), e premte (AL). 


MT1: Samstag (DE), lordag (SE), Cumartesi (TR), sambata (RO). 

MT2: Samstag (DE), lordag (SE), Cumartesi (TR), sambata(RO). 

MT3: Samstag (DE), lordag (SE), Cumartesi (TR), sambata (RO), e shtuné 
(AL). 

MT4: Samstag (DE), lordag (SE), Cumartesi (TR), sambata(RO). 

Native speaker: Samstag (DE), lordag (SE), Cumartesi (TR), sambata(RO), 
e shtuné (AL). 





BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 29 


Sunday MT1: Sonntag (DE), séndag (SE), Pazar giinti (TR), duminica (RO). 
MT2: Sonntag (DE), séndag (SE), Pazar giinti (TR), duminica (RO). 
MT3: Sonntag (DE), séndag (SE), Pazar giinii (TR), duminica (RO), e diel 
(AL). 
MT4: Sonntag (DE), s6ndag (SE), Pazar giint' (TR), duminica (RO). 
Native speaker: Sonntag (DE), s6ndag (SE), Pazar giinti (TR), 
Duminica (RO), e diel (AL). 





Spectacles MT1: Schauspiel (DE), anblickar (SE), gézliik (TR), 
ochelarii (RO). 
MT2: Brillen (DE), glas6gon (SE), gézliik (TR), ochelarii (RO). 
MT3: Schauspiele (DE), glas6gon (SE), gozliik (TR), 
ochelari (RO), syze (AL). 
MT4: Brillen (DE), glasdgon (SE), gézliik (TR), ochelarii (RO). 
Native speaker: Schauspiele (DE), Underhallning (SE), 
gdsteri (TR), spectaculul (RO), shfagjeve (AL). 


Table 4 presents the data received by the online MTs and the natives concerning the required 
sentences and phrases in the first research. 


Table 4: the phrase output received in the 1* piece of research. 
MT1: Heiliges Gallen Theater Berlin Montag, den 4. Marz Um 6 P.M. Liedwettbewerb fre 
MT2: St. Gallen Theater Berlin Montag, den 4. Marz Bei 6 Uhr. Liedwettbewerb 
Kostenlose entrance 
MT3: Sankt Gallen Theater 4. Montag Marz um 6 Uhr Song Contest 
Eintritt frei 
MTA4: St. Gallen Theater Berlin. Monday, March 4. Bei 6 Uhr. Song Contest. Freier Eintritt 
search confirmation: Theater St. Gallen Berlin Montag 4. Marz, Um 18.00 Liedwettbewerd 
Eintritt frei 
MTI1: Nobel teater Stockholm Fredag April 20" Pa 6 P.m. songstrid fritt hanrycka 
MT2: Nobel Teater Stockholm Fredag 20" April Hos 6 pm songstrid fritt intrade 
MT3: Nobel Theater Stockholm Fredag 20" april Vid 06:00. Song contest Fri entré 
MT4: Nobel teater Stockholm. Fredag April 20". Vid 6:00. Song contest. Fri entré 
Web search confirmation: Teater Nobel Stockholm Fredag 20" April. 
KL. 18:00 songstrid Gratis entré 


MT1: Kemal Sinema Instanbul 25 Mayis Cuma Giinti 6 Pm Sarki yarismasi Ucretsiz giris 
MT2: Kemal Sinema Instanbul 25 Mayis Cuma Guniti Sarki yarismasi serbest giris/ 
alinmayan giris/ ticretsiz giris/ davetsiz giris izni 6 pm 

MT3: Kemal Tiyatro Instanbul. Cuma 25 Mayis. 06:00 at. Sarki yarismasi. serbest giris 
MT4: Kemal Sinema Instanbul. 25 Mayis Cuma giinii. 6 Pm. Sarki yarismasi. Ucretsiz giris 
Web search confirmation: Konser Tiyatro Kemal Istanbul 25 Mayis Cuma, Saat: 18:00 
sarki yarismasi ticretsiz giris 


MT1: language unavailable 

MT2: The website was temporarily experiencing technical difficulties. 
MT3: Kombétar Theater, Tirana E premte 21 prill Né orén 6 pasdite 
garé song hyrja falas 

MT4: language unavailable 

Web search confirmation: Teatri Kombétar Tirana E premte 21 prill, 
né orén 18:00 konkurs song hyrja falas 





30 


EFTYCHIA DAMASKOU 


MT1: Teatrul national din Bucuresti Vineri 30 Martie La ora 6 pm 
Melodie concurs Intrarea libera 


MT2: Teatrul national din Bucuresti Vineri 30 Martie La ora 6 pm 


Melodie concurs 

Intrarea libera 

MT3: Teatrul national din Bucuresti Vineri 30 martie la 18 concurs de 
cantec intrare libera 

MT4: Teatrul national din Bucuresti Vineri 30 Martie La ora 6 p.m. 
Melodie concurs Intrarea libera 

Web search confirmation: Teatrul National (din) Bucuresti Vineri 30 
martie ora 18.00 





Melodie concurs, Intrarea libera 


Figure 3 presents the percentage of error types detected in the sentences analyzed by the native 


speakers in the second research. 





output analysis concerning phrases 


100% 
87.50% 


Making no Translation Identical to corrected by accepted by 
sense into English native native native 


needed speakers speakers speakers 


Figure 3. Output analysis concerning the phrases in the 2" piece of research. 


Table 5 shows the types of errors detected in word analysis of the output retrieved from the online MTs 


in the second research. 


Table 5: The errors in word output in the 2" piece of research 


Type of error Number of words 


Spelling 


nie 


= ™ ” 





BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 31 


Tables 6 through 9 present the data received from the online MTs and thecorrections proposed by the 
native speakers concerning the sentences required in the last two activities given to pupils. 


Table 6. The sentence output for Swedish in the 2™ piece of research. 


Source sentence: I live in 30, Venizelou Street. 


T2 [JAG Venizeloustreet30_——SSOSCSC~—S—SS 
T3 | Jagbor pa 30 Venizelos SSCS 
S| Jagbor pi gatan Venizelou30—S—S 
T1__| Tabussanumrerar 40 och fir av pa postera” Venizelow” 


Ta buss nummer 40 och stig av vid Venizelos 


O38 | 3 | 


= 
= 


FE 
WN 


=|5 
SS 


5 
o3 


< 
= 


Ta buss nummer 40 och ga av vid stationen “Venizelou' 


° ° cc ® > 
e 


>. Ta buss 40 och ga av vid busshallsplatsen “Venizelou 


Source sentence: My house is next to the post office. 
Mitt hus ar bredvid t-nolla postar 
T2 Mitt hus ligger intill post 


; 
2 


= 


T 
T 
S. 
Tl 


Al</< 
le 


= |= 


7 
MTs 
a 


Table 7. The sentence output for Dutch in the 2" piece of research. 


= 





Source sentence: I live in 30, Venizelou Street. 


MTI_| Ikbliffin dewegBenizeloy30__——~S~S~S 
T 
T4 | Tlivein 30, Venizeloustret SS 


N.S. Ik woon in Venizelosstraat 30 


cc >> 
e 


Source sentence: Take the bus number 40 and get off at the stop “Venizelou 


" 


T2 Neem bus nummer 40 te stoppen "Venizelou 


T Neem bus 40 en stap uit bij "Venizelos 
Neem de buslijn 40 en stap uit bij station "Venizelou 
N.S. Neem bus 40 en stap bij de halte "Venizelou’. 


Source sentence: My house is next to the post office. 


Mijn huis is naast to post 


De busnoymero 40 van Paros en u daalt in de houding ,,Benizeloy” 


" 
e 





32 





EFTYCHIA DAMASKOU 


MTS 
MTs 


S. | Slarechtsafen ga echtdoon——SOSOS—S~S~S—S~S 


Table 8. The sentence output for Portuguese in the 2" piece of research. 


rj 


BE 
Sie 


< 
3 





Fa 
M|r4 
ws 


Source sentence: I live in 30, Venizelou Street. 


Eu vivo na estrada Benizeloy 30. 
Eu vivo na rua Venizelou 30 


A 
2 


< 
SS 


Eu vivo na rua Venizelos 30. 
Eu vivo em 30h, rua Venizelou 


; Moro na rua Venizelou 30. 


5 
o3 


< 
= 


Z, 
n 


cc > 
e 


Source sentence: Take the bus number 40 and get off at the stop “Venizelou 
MT1 O noymero 40 e vocé da barra-énibus de Paros vai para baixo na atitude 
“Benizeloy” 


Apanhe o autocarro numero 40 e para parar "Venizelos' 
ae Apanhe o autocarro 40 e desca na "Venizelos' 
Pegue o Onibus numero 40 e saia na estacao "Venizelou' 


N.S. Apanha 0 autocarro 40 e desce na paragem ‘Venizelou’. 


Slo |S 


Source sentence: My house is next to the post office. 


TI 
7 


S. Vira a direita e segue em frente 


Table 9. The sentence output for Hungarian in the 2" piece of research. 
Source sentence: I live in 30, Venizelou Street. 


MT1__| En tovabbra is a kézuti Benizeloy 30 


MT2 En élek a Venizelou utca 30 





BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 


|Elek30Venizelosz, 
A Venizelou utca 30-ban lakom 
| Source sentence: Take the bus number 40 and get offat the stop “Venizelou”, 


/ lA J i 


N.S. Szallj fel a 30-as buszra és szallj le a Venizelou megallonal 


|Ahdzmellettvanatopost 
|Ahdzmellettvanaposta 
|Ahdzmellettaposta 
| Ahdz mellettvanamailconnector 

N.S. A hazam a posta mellett van. 
| Source sentence: Keep on going straight ahead and turn right, 
T1__| Tovabb egyenesen és jobbraforgo 
| MT4__| Tovabb egyenesen és jobbravalé navigdlishoz 


Figure 4 presents an error analysis for the posters created in the last activity of the first research. 


25 


—— Posters 
20 : a without 
1 5 . any error 


10 


5 es __ | _ == Posters 


O : : several 
oS n> ~~ g c . ; 
st ~ on oy X x a ; > iy 
CE ES FF SF FE ES errors 
. “ of - oS 
cS & SS Se & x co _» 
+ ‘' F x C 
aie ~ \ > o : ~ ~S SO 
1 = ~ ate , i one pe 8 P 
FD OS OS osters 
yy Lae 8 w XR 
ow Ss S eS 
SO SE Lg with 
many 
fo Crrors 
co 


Figure 4. Error analysis of the pupils’ productions in the last activity of the 1” piece of 
research. 





34 EFTYCHIA DAMASKOU 


Finally, Table 10 presents an error analysis of the text produced in the last activity of the second 
research. 


Table 10. Erroranalysis ofthe pupils’ productions in thelast activity ofthe 2" piece of 
research. 


Totally several wrong many incomplete no 
correct mistakes order mistakes answer 


Bd 
a [fo 
a 


2.3. ANALYSIS. The control and analysis of the output retrieved from the MTs used for both the required 
sentences and single words reveal a significant difference between word and sentence translation accuracy. 





In particular, most of the sentences needed important corrections. As regards the words, the percentage of 
errors detected is negligible. Itmainly concerns lexical, grammatical, and spelling inaccuracies. The range 
of errors in word translation consists of transliterated words from English (i.e., stadium in RO and TR), 
spelling errors (i.e., “chitara” in RO instead of “chitara”), meaning ambiguity (i.e., for ‘spectacles’ we got 
Brillenin German instead of Schauspiele, or gézltik ‘sunglasses’ in Turkish). 

As regards the sentences retrieved, most of the suggested answers present a significant divergence. Take 
Hungarian, for example: for the sentence ‘I live in 30, Venizelou Street’, we obtained four different options 
(En tovdbbra is a kéztiti Benizeloy 30, En élek a Venizelou utca 30, Elek 30 Venizelosz, En pedig élek a 
Venizelou utca 30). None of them were incorrect, since the native speaker did not correct them. He 
alternatively proposed a fifth one since, according to him, there were too many mistakes. Additionally, 
sentences with a little risk of meaning ambiguity and less complex structure were closer to the native 
speakers’ suggestions. For instance, if we take a look at the translations retrieved for the sentences “Take the 
bus number 40 and get off at the station Venizelow’ and “My house is next to the post office’ we can easily 
see that the retrieved output for the first sentence (see Tables 8 and 9), which is slightly more complex than 
the second one, presents a higher divergence compared to the output retrieved for the second sentence. 
Apparently, the collocation “take the bus” and the phrasal verb “get off the bus” are quite “problematic” for 
the online MTs database. On the other hand, the second sentence appears to have similar translations, quite 
close to the native speaker suggestion. Therefore, it is obvious that the teacher who has no idea about 
Hungarian or Portuguese runs a huge risk of lack of validity, in case they rely only on online MTs. In 
addition, most of the sentences retrieved in both pieces of research present an inappropriate syntactic 
structure, as they keep the source language structure). There are even sentences which make no sense at all 
(i.e., Za buss nummer40ochattavbryta Venizelou(SE), Debusnoymero 40 van Parosenudaaltinde houding 
,» Benizeloy (NL), Hetgingrechtdoorverderendraaitnet(NL), Entovabbraisa kézuti Benizeloy30(HU), Vegye 
le a busz szdma 40, ‘romok"(HU), O noymero 40 e vocé da barra-6nibus de Paros vai para baixo na atitude 
“Benizeloy (PT)). Weshouldalso note that less common languages such as Albanian are not included in 
most of the online MTs. 


3. CONCLUSION. Our research comes to the main conclusion that online MT tools are useful and 
trustworthy only for word to word translation and not for the study of foreign language syntax or their 
teaching. Online MTs are a free and accessible tool which gives instant access to linguistic content in many 


BUILDING MULTILINGUAL COMPENTENCE: THE AWAKENING TO LANGUAGES APPROACH 35 


languages and is useful for the study of the morphology of languages through a cross-linguistic approach. 
By extension, MTs assist in the awareness of linguistic diversity, similarities, and differences. The 
considerably high percentage of errors detected in sentence output, mainly in syntax and grammar, leads 
to the conclusion that online MTs are not trustworthy for building plurilingual competence. However, the 
pupils’ relatively high scores in both pieces of research reveal that online MTs are reliable within a logical 
teaching context. This permits an interlingual approach combined with native speaker resources, in order 
to obtain valid results. Besides, as Vitek (2000) notes, “since a machine does not understand concepts of 
accuracy or meaning and lacks agency, accurate translations may only be possible if the input range is 
extremely limited, with the input, translation process, and output subject to a strict and professiona/ human 
control”. 


4. ACKNOWLEDGMENTS. This research would not have been completed without the important 
contribution of Olivier, Maria, Agoston and Vitoria, who voluntarily agreed to correct the data retrieved 
from the online MT in their mother tongues. 


REFERENCES 


ANDERSON, D. 1995. Machine translation as a tool in second language learning. CAL/CO Journal 13-1: 
68-97 

BUDIANSKY, S. 1998. Lost in translation. The At/antic Online. 

CANDELIER, M. 2003. Evlang - l’éveil aux langues a l’école primaire — Bilan d’une innovation 
européenne. Bruxelles: De Boek - Duculo 

---, 2007. Cadre deréférence pour les approches plurtelles deslangues et des cultures. Strasbourg : Centre 
Européen pour les Langues Vivantes / Conseil de PEurope 

CENOZ, J., & F. GENESEE. 1998. Psycholinguistic perspectives on multilingualism and multilingual 
education. In Beyond bilingualism: Multilingualism and multilingual education, ed. by J. Cenoz & F. 
Genesee, 16-32. Clevedon, UK: Multilingual Matters 

COSTE, D., D. MOORE, & G. ZARATE. 2009. Vers un Cadre Européen Commun de référence pour 
lenseignement et lapprentissage des langues vivantes: études préparatoires, Version révisée. 
Strasbourg: Division des Politiques linguistiques, Strasbourg 

COUNCIL OF EUROPE. 2001. Common European tramework of reference for languages: Learning, teaching, 
assessment. Cambridge, UK:Cambridge University Press. 

DIMELA, E. 2013. Turkish adjectives as loanwords in the dialects of Asia Minor: A first approach. In 
Patras working papers in linguistics, 3 (Morphology and language contact): 92-109. 

FILDER, S. 2006. Awakening to languages in primary school. EZ Tjourna/60(4): 346-54. HARRIS, H., 2010. 
Machine translations revisited: Issues and treatment protocol. Harry 

Harris The Language Teacher. 34.3 « May / June 2010. 

MCCARTHY, B. 2004. Does online machine translation spell the end of take-home translation 
assignments? In CALL-EJ Online 6.1. 

MELBY, A. K. 1995. Why can’t a computer translate more like a person? Fortune City. 


36 EFTYCHIA DAMASKOU 


NAPIER, M. 2002. The soldiers are in the coffee - An introduction to machine translation. Cultivate 
Interactive, 2. 

Nifio, a. 2009. Machine translation in foreign language learning: Language learners’ and tutors’ 
perceptions of its advantages and disadvantages. RECALL 21:241-58. 

RUSCHOFF, B. 1993. Language learning and information technology: State of the art.” CALICO Journal 
LO, O07: 

STANICIC, M. 2003. Free movement of people, goods, services, and capital in the context of the Balkan 
Stability Pact. Connections: The Quarterly Journal, vol. 2, no. 2:21-37. 

VITEK,S. V. 2000. Reflections of a human translator on machine translation; or, will MT become the Deus 
Ex Machina rendering humans obsolete in an age when Deus Est Machina? Translation Journal,A(3) 
www.accurapid.com/journal/13mt.htm. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:2.37-46 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





CLASSIFYING UNCLASSIFIED SPEECH ERRORS: 
AN EXAMINATION OF THE UNEXAMINED 


WILLIAM J. SULLIVAN SARAH TSIANG 
Uniwersytet Marti Curie-Sktodowskiej Eastern Kentucky University 


Abstract: Previously unexamined examples from our corpus of speech or performance errors are examined 
and classified in an attempt to determine whether any new categories of errors can be found and to ensure 
that the categories so far discovered account for all types. Though a number of interesting and sometimes 
amusing combinations of errors were uncovered, the validity of previous findings was demonstrated overall. 


Key words: speech errors, performance, encoding, decoding, relational networks 


Languages: English, Polish 


DURING THE PAST FOUR YEARS we have been gathering, analyzing, and classifying speech or 
performance errors. Our corpus comprises several thousand examples of Polish and English utterances 
considered consistent with the speaker's linguistic system, but not what the speaker intended to say. So far, 
the data have been analyzed according to four main categories: timing errors (anticipation, perseveration, 
spoonerisms), tactic pattern errors (i.e, structural errors, broadly defined), unintended blends, and 
substitutions (Sullivan 2011; Sullivan & Tsiang in press b, c; Tsiang & Sullivan in press). In addition, 
redundancies collected as anomalies were examined (Sullivan & Tsiang in press a). At the end, a substantial 
number of examples remained unclassified. The unclassified examples are categorized, described, and 
explained here. They are found to be consistent with our previous work on speech errors and supportive of 
the model in Figure 1 (following page). 


1. THEBACKGROUND. The descriptive model used by WJS during more than forty years of describing 
major parts of several languages is summarized in the diagram in Figure 1, which is intended to represent 
the logic underlying the storage and processing of linguistic information. Figure 1 shows a linguistic system 
connecting the cognitive store or general knowledge base with the systems for the production and 
perception of speech sound. It is a relational network with five strata. Each stratum defines that stratum’s 
emes or contrastive elements and is centered around a generalized (not necessarily optimized) tactic 
pattern, where the structural relations between the emes are specified. Each tactic pattern has realizational 
relations to adjacent tactic patterns or to the cognitive store (in the case of semology) or the systems 
for the production and perception of sound (in the case of hypophonology). Encoding generally begins 
with inputs to semology from the cognitive store. These inputs are in the form of spreading activation that 
proceeds through the linguistic network from top to bottom. Soon after semological processing begins, 


38 WILLIAM J. SULLIVAN AND SARAH TSIANG 


activation spreads through realizational relations to the syntax, where syntactic processing begins. Soon 
after syntactic processing begins, activation spreads through realizational relations to the morphology, and 
so on through the system. At this point semological and syntactic processing may still be going on. Thus, 
we have loosely-yoked parallel processing by spreading activation. Each stratum may also take random rest 
periods during the processing. The linguistic system is highly elaborated, if logically simple in conception, 
so it was easy to observe errors in production. Given the resulting complexity of the wholesystem, it is 
surprising that error-free output is the general rule. We have classified four (five, counting redundancies) 
error types, but with a substantial number of examples remaining, we wondered whether there are any 
additional types of errors. So we turned to the unexamined examples. 


=< FP CogStore =___. Semantics/Pragmatics 

ae fooefeonf seis . Language interface 
— Semology <> Semotactics 

: | ier + os T : Semo-lexemic 
<> . Syntax i> Lexotactics/Syntax 

i t dae + te aad, t i Lexo-Morphemic 
<i> Morphology > Morphotactics 

Raa cae yO i Morpho-Phonemic 
7” Phonology , > Phonotactics 

! - } betes t a aes ty T ! Phono-hypophonemic 
> ' Hypophonology ter" Hypophonotactics 


Oral-Aural Interface 


Figure 1, Outline of the linguistic system, relative to the cognitive 


2. ADDITIONAL FINDINGS. What we found in our further analysis was not sets of new error types but 
rather examples motivating a finer analysis of error types. There were errors that did not admit a unique 
solution and errors that were in fact a combination of errors. There were also examples that were simply 
incoherent. Finally, there were examples that proved to be interesting in unexpected ways. Yet all but the 
incoherent * were compatible with the system outlined in Figure 1 and the hypothesis of spreading 
activation with random rest periods. Moreover, the groupings were found in both English and Polish 
examples. There was nothing that could be ascribed to a language-specific peculiarity. We turn to these 
examples now, taking them by grouping. 


2.1. AMBIGUOUS LOCUS CLASSIFICATION. By locus of error, we mean the place on Figure 1 where the 
error may have occurred. The first group of examples involves timing errors, which occur at the intersection 
between two strata. Both English and Polish examples are given in Table 1. 


' If we found them incoherent, of course we cannot assert their compatibility with a linguistic system 


CLASSIFYING UNCLASSIFIED SPEECH ERRORS 39 


Table 1. Ambiguous locus classification. 


ee 
komplet koscielowy ‘church linens’ | 4 
5 


przedzialek miedzy zebami przerwa ‘gap’ 
‘cleft between teeth’ 


gdzie kucharek jes¢ tam nie ma co 7 gdzie kucharek szes¢, tam nie ma co jes¢ 


szesc (cannot be parsed) ‘where there are 6 cooks, there’s no food’ 





Most of these examples involve timing errors, which are both common in occurrence and often very 
easy to notice. Recall that timing errors appear at the intersection between two strata. Brick in the first 
example is certainly a perseveration of the vowel in the three preceding lexemes, but it is unclear whether it 
is a morpho-phonemic perseveration of the syllabic peak or a phono-hypophonemic perseveration of the 
feature high. In the second example we have an anticipation. But it could be a lexo-morphemic anticipation 
of the morpheme cause, a morpho-phonemic anticipation of the syllable onset, or a phono- hypophonemic 
anticipation of the place feature. The third example is a spoonerism, but it could be a semo-lexemic 
spoonerism of the two lexemes, a lexo-morphemic spoonerism of the two morphemes, or a morpho- 
phonemic spoonerism of the two onsets. 

The Polish examples begin with a perseveration in example (4). It could be a morpho- phonemic 
perseveration of the syllable onset or a phono-hypophonemic perseveration of the place feature. Example 
(5) is a substitution, but it could be the entire lexeme or just thestem morpheme (dziafek ‘divider’ for rwa 
‘break, tear’) that is involved. Finally, the sixth example has a spoonerism involving a proverb, the Polish 
equivalent of ‘too many cooks spoil the soup.’ We saved this one for last because it has an even more complex 
set of error- locus possibilities. That is, the spoonerism could be of the two lexemes, making it semo- 
lexemic, but it could also be morpho-phonemic. In the latter case, it could involve the syllable onset or the 
entire syllable. 


2.2. AMBIGUOUS ERROR TYPE. For the set of examples in Table 2 (following page), the locus of the error 
is clear, but how to identify the error type is not, as more than one classification applies. 

In example (1), Nothing extraordinary about somebody losing his celebrity life toa TV show, was said 
about the suicide of Russell Armstrong, husband of Taylor Armstrong, who was featured on “Real 
Housewives of Beverly Hills”. This could have been a morpho- phonemic perseveration of the onset of a 
stressed syllable or a phono-hypophonemic perseveration of the place feature. It could also have been a 
rather dark substitution of the entire lexeme and might even be termed a Freudian slip. Either way the 
output was striking in context. Other English examples of ambiguous error type are discussed in later 
sections, as they have additional points of interest. 


40 WILLIAM J. SULLIVAN AND SARAH TSIANG 


Table 2. Ambiguous error type. 


ae 


Nothing extraordinary about somebody wife 
losing his celebrity /ife to a TV show. 


zaplacilam 5 minut zaplacilam za 5 minut ‘for 5 


. minutes’ 
I paid 5 minutes 


zaplacitam 5 zlotych ‘5 zloty’ 


konser watorium ‘conservatory konwersatorium ‘symposium’ 


woda salicylowa ‘salicylic water’ 4 | woda utleniona ‘H2O2' 


spirytus salicylowy ‘salicylic acid’ 


lepszy gotab w garsci niz kanarek na dachu lepszy wrdbel ‘sparrow w garsci 
‘better a pigeon in the hand than a canary on niz golab pigeon’ na dachu 
the roof 





The second example is ambiguous in that there are two possible correct solutions. If the mnutis correct, 
then the preposition was lost. If the z/otych is correct, no preposition is needed. So was minut substituted 
for zfotych or was the output the product of a blend?’ The third example is three ways ambiguous. It could 
be a morpho-phonemic spoonerism of the onsets of two consecutive unstressed syllables, a phono- 
hypophonemic spoonerism of the place feature of two adjacent consonants, or a substitution. Even here it 
is possible to imagine a lexemic substitution or a morpheme substitution. The fourth example parallels the 
second, in that which lexeme was substituted depends on what was intended. Finally, the fifth example 
involves the Polish proverb that parallels English “a bird in the hand is worth two in the bush”. People 
frequently misspeak one or both of the bird names. In the present instance it could be two substitutions or 
a semo-lexemic spoonerism and a single substitution. 

The examples in the next set exhibit more than one speech error, but both are of the same type. 


2.3. THE SAME ERROR TYPE MORE THAN ONCE. In this set of examples (Table 3) we see output that 
results from two errors of the same type, generally on two different strata, though more than two are possible. 
These errors are especially interesting, because they provide information about processing. They show that 
the processing on one stratum intersects with but does not depend on having correct input from other strata. 
Each stratum is capable of making its own mistakes, and the effects of an error on an upper stratum are 
carried over onto the lower strata where the same type of mistake can be made independently. 


* In this and subsequent discussion, or should be understood INclusively, not EXclusively. 


CLASSIFYING UNCLASSIFIED SPEECH ERRORS 4] 


Table 3. Same error type more than once. 


a a 
reais [me 


powypisywawuja powypisuja you will be writing out some ... 


In the first example we see the effects of two timing errors. A semo-lexemic spoonerism produces proud 
quite instead of the intended form. Subsequently, a morpho-phonemic anticipation results in the syllable 





peak of quite being produced in prideas well. 

The English error is relatively straight-forward, compared to the Polish examples, which increase in 
complexity as presented. In 2, kurturarny, the first ris an anticipation and the third ris a perseveration. 
Both are phono-hypophonemic timing errors. In 3, otwierasz w okni kuchno first shows a semo-lexemic 
spoonerism between a noun (okno ‘window’) and its qualifying prepositional phrase (w kuchni ‘in the 
kitchen’). Additionally there is a lexo- morphemic spoonerism of root morphemes (ok and kuch). Finally, 
in 4, powypisywawuja shows two morphotactic errors with two infinitive suffixes in addition to the 
imperfectivizing suffix before the non-past tense ending. That is, the infinitive of 4 would have to be 
Powypisywawywacor powypisywawowac. Here yw-acan be analyzed as imperfectivizing suffix + infinitive 
suffix and ow-aas verb-forming suffix +infinitive suffix, either way a doubled error. 

There are also examples with more than one error but of different types. We turn to those now. 


2.4. MULTIPLE ERRORS OF DIFFERENT TYPES. The examples with mixed-type errors are presented in 
Table 4 (English) and Table 5 (Polish, following page). 


Table 4. Multiple errors of ditterent types (English). 


a 


This has been one mind blogging... mind-blowing 
mind- boggling ... mind-blowing case 


I need to do the applica/... a a 


She’s a egal consult [kon-su ‘It] en legal consenting adult 
Oh, by the way, that strong odor of smish fish smell 
Where does sex tip [sckstip] come from? 





With the exception of 5, all these examples were self-corrected to the intended form. In 1, blogging is an 
unintended blend of boggling and blowing followed by a phonotactic jumble. Possibly it was a substitution 


42 WILLIAM J. SULLIVAN AND SARAH TSIANG 


for blowing, corrected to boggling, then amplifiedby the originally intended b/owing. In 2, applical results 
from a semo-lexemic spoonerism followed by a morphotactic blend. Number 3 shows a redundancy: if 
someone is a consenting adult, she’s legal. There is also a phonotactic blend producing [kon-su ‘It], with the 
onset of the second syllable from consentingand the final from adult. In 4 we see a semo- lexemic spoonerism 
between fish and sme// followed by a phonological blend. There is also a redundancy involving odor and 
smell. 

Example 5 requires some discussion. Clearly a spoonerism is involved here. The simplest analysis, based 
on typing alone, would be that the vowels remained in position and all the consonants were spoonerized, 
retaining their phonotactic (actually graphotactic) positions, as usual in timing errors. But remember that 
this example was spoken, so an analysis based on the written form isn’t reasonable. Rather we posit a semo- 
lexemic spoonerism producing [slks stcp] followed by a morpho-phonemic spoonerism between adjacent 
syllable peaks. 

In Table 5, the first example is as incoherent in Polish as the attempted English gloss is. There is a 
complete lexotactic jumble and in the process a lexeme (reki ‘hand?’) is lost. Example 2 shows a lexemic 
substitution of dostafgot’ by miaf‘had’ with a subsequent morpho-phonemic spoonerism of syllable onsets 
involving miafand by. But it could also be a phono-hypophonemic spoonerism between manner features. 
Example 3, as observed, involves a morphotactic mistake on the verb conjugation and a phonotactic error in 
the hard d. Example 4 shows a semo-lexemic spoonerism between two adjacent nouns and an incorrect 
imperative form of wyjmowac'take out’. The nominal morphology is correct. 


Table 5. Multiple errors of ditterent types (Polish). 


ee 


wez piwo do puszki wez puszke piwa do reki 


‘¢take beer into a can’ ‘pick up a can of beer’ 


tw6j ojciec my bial szok tw6j ojciec by dostat szoku 


‘your father would have a shock’ ‘your father would be shocked’ 


dzisiaj idymy ‘today ??’ dzisiaj idziemy ‘today we're going’ 


wyjm lodéwke z satatki wyjmij satatke z lodowki 


‘take the refrigerator out of the salad’ ‘take the salad out of the refrigerator’ 





2.5. SOMEHOW INCOHERENT EXAMPLES. We also found examples of utterances that were incoherent 
in some way. The incoherence made it impossible to tell with any certainty what was actually intended, thus 
the error could not be analyzed and categorized with confidence. Yet they are not without interest in relation 
to language processing. 

For example, utilities are got to put back the pieces is a syntactic and possibly morphological jumble, if 
the intended output was utilities have to put the pieces back together. Itused to beingoltthata player would 
never talks backs about Tigeris even worse. Seven days to 24 hours fully-clothed on top of the church shows a 
semo-lexemic spoonerism (should be 24 hours to 7 days), but it still makes little sense, even with the 
spoonerism corrected. We also have a phonologically incoherent example in o/d people are definitely having 


CLASSIFYING UNCLASSIFIED SPEECH ERRORS 43 


a hijer (*harder) time getting hired, though it might have an unintended blend combined with a phonotactic 
error. 

There are also Polish examples. Poznafam to z wlasnej autopsji J learned that from my own autopsy 
makes the speaker the first person to survive being autopsied. Pare odcinki temu‘? is so bad grammatically 
that a colleague said she didn’t believe a native speaker could have said it. Even if we clean up the grammar 
and get pare odcinkdéw temu ‘acouple of fragments ago’, it’s unclear what might have been intended. 

In one case the student submitting an incoherent example said she probably shouldn't submit it, 
because it didn’t make sense and she didn’t really know whether she had gotten it right. We suspect that this 
was not the only time such a circumstance arose, but the students just discarded the examples, perhaps 
deciding that they simply had not heard itaccurately. 

Another interesting discovery that we made involves a set of examples offered as errors, whereas they are 
actually dialect alternatives. There were also several examples that aroused our interest for various reasons. 
We discuss these in sections 3 and 4. 


3. DIALECT FORMS AS ERRORS. In English we found rooftops [ruftaps]. The speaker immediately 
corrected himself to [ruftaps]. This was particularly amusing to WJS, for whom forms like ruf‘roof, rut 
‘root’ Auf hoof, and rum ‘room’ are not only standard but normal. 

In Polish, students turned in two examples of errors involving gapia sie ‘they gape’, standard 
pronunciation with nasal 0 [6]: gapiom with clearly articulated mand gapiof [ow] without nasalization. 
Even linguistic majors are very hard to convince that regional dialects are anything but substandard, and so 
it was with these two. What makes them humorous is the juxtaposition between them. Gapiomisa working- 
class variant. Academics used to make fun of Lech Watesa because he would pronounce the nasal consonant 
in word-final position on verbs instead of the nasal vowel. Conversely, gapiof is not from a lower-class 
dialect. It is frequent along the Czech border, where it might be a borrowing from standard Czech, but it’s 
also widespread in Warsaw, the capital, where (according to many Poles) everyone thinks they’re superior to 
the rest of Poland. We also had several examples of words like rozumisz instead of rozumiesz. Like gapiot, 
they may reflect a border dialect. We note, however, that examples of this sort were nowhere near as 
common as other kinds. 


4. VARIOUS REVEALING EXAMPLES. These examples, some English and some Polish, display different 
interesting features of decoding and encoding of linguistic messages, in addition to reifying some of the 
details of the logic underlying linguistic information storage relative to Figure 1. 

The first example involves a newscaster on CNN reading the text of an email from a listener. The text, 
visible on the screen, was “I am being treated like a second-hand citizen because” The newscaster reading 
it says “a second-class citizen”. Since the email is from a smoker responding to a proposed smoking ban, 
second-class is the correct modifier. This indicates that decoding a message and re-encoding it are parallel 
and opposite processes but are not mirror-image processes. The newscaster read, decoded, and correctly 
understood the message in spite of the error, then re-encoded it correctly without the error, never noting the 
discrepancy. 

To emphasize how common this phenomenon is, consider this sentence taken from our abstract: “ Then 
you d have to deal with the burden of the guilt.” The actual example, produced by presidential historian 
Dolores Kearns Goodwin, is “the guilt of the burden”. This error, in which we reported the correct version 
and not the spoonerism, while intending to report the spoonerized error, was originally typed in by WJS and 
went past several proofings by WJS and ST and at least one other person. No one, not even (or perhaps 
especially) the most sensitized, is immune to this kind of decoding-encoding error. Worse yet, the example 
was originally recorded by WJS. 


44 WILLIAM J. SULLIVAN AND SARAH TSIANG 


The next example is a spontaneous oral production. When the speaker decoded what they had said, 
they realized that the attempt was not what was wanted. The next attempt zeroed in on a more satisfactory 
if still inappropriate result: “So it’s like the flea wagging the dog the flea that’s wagging the tail that’s wagging 
the dog.” 

Another English example is a malapropism or possibly a Freudian slip: “If you experience a resurrection 
lasting over three hours, see a doctor.”* Technically, of course, it is a substitution with a morphologically 
related lexeme. 

We also discovered a hypophonotactic blend that joins only one other in this subtype of unintended 
blends. The host on a talk show introduced Amanda Knox, accused of complicity in the murder of a fellow 
exchange student, as Amanda Knot. The blend worked as seen in Table 6 (following page). 


Table 6. Hypophonotactic blend. 





The grapheme x is realized as a ks sequence. The xis related to the features Oral Closure (Cl) and Dorsal 
(Do). The sis related to the features Spirant Friction (Sp) and Apical (Ap). Blending the two consonants by 
taking the manner feature Cl from kand the place feature Ap from s produces Cl & Ap, which is ¢ verbal. 
They should be dynii sie z komina ‘it’s 

There are also several Polish examples, which offer additional points of interest. Komini sie zdymaand 
stoli na lezy, neither of which has a reasonable possible meaning, are the product of a lexo-morphemic 
spoonerism between stems, one nominal and the other smoking from the chimney and /ezy nastole ‘it’s 
lying on the table. This suggests that the speakers have a stem node that is not specific to either nouns or 
verbs, a phenomenon seen in English lexemes like comb, which are not specific to either verbs or nouns. 

Another example is w stanie Kanzas siedem osob zabito tornado. The student said it should be wstanie 
Kanzas siedem osob zostato zabitych przez tornado‘in the state of Kansas seven people were killed by a 
tornado’. The student’s version is a correct one, but she was confused by the fact that s/edem osdb and 
tornado are both nominative-accusative forms. She was reading siedem osdbas the nominative subject and 
tornado as the accusative object, so the sentence would mean ‘in the state of Kansas seven people killed a 
tornado’. But siedem osdbcould be read as accusative object and tornadoas the nominative subject, meaning 
that ‘in the state of Kansas a tornado killed seven people’. This shows that if there is an ambiguity in case 
endings and two nouns could each be nominative or accusative, then the unmarked interpretation assumes 
SVO order. Of course, in the proper discourse context, OVS order could be the reasonable choice. Thus, 
it’s not an actual mistake, but it tells us something about both encoding and decoding. The OVS order- 
discourse connection tells us that semology exerts controls on syntax. At the same time, the decoding may 
well proceed without those controls or with those controls much weaker, as the syntactic processing begins 
in advance of the semotactic processing. 

Our last example is one gathered by an advanced English literature student. A native speaker in a 
conference setting said to a small group of early risers waiting for the restaurant to open, Sniadamy. The 


° In this case we advise seeing a priest. 


CLASSIFYING UNCLASSIFIED SPEECH ERRORS 45 


meaning was clear in context: siadamy do Sniadania ‘\et’s sit down to breakfast’. Though sniadamy made 
immediate sense, WJS had never heard it and thought it might be a neologism. A little dictionary research 
provided some interesting facts, however. This verb is not found in either the recent Kosciuszko Foundation 
dictionary (Fisiak 2003) or the new PWN dictionary (Sobol 2002). But it is found in the old Kosciuszko 
Foundation dictionary (Whitfield 1963). Rather than a neologism, it seems to be an obsolescing lexeme.* 
This seems strange, as it appears to be a useful word, unremarkable in its morphology and parallel to Russian 
zavtrakat’‘eat breakfast’ < zavtrak ‘breakfast’. But an explanation is beyond our present scope. 

This completes our survey of interesting and representative examples that werediscovered in our 
previously unexamined corpus of speech errors. We now summarize their implications. 


5. CONCLUSION. This hunt through thousands of previously unexamined examples turned up some 
interesting insights. First, none of the findings of our previous series of studies was negated. Second, 
everything here is compatible with a relational network configured according to Figure 1. Third, we got 
two new pieces of evidence about the use of the linguistic system. That is, an error on an upper stratum 
carries through to the lower strata, which may have erroneous output added to the previous error. This 
supports top- down loosely parallel processing. Moreover, decoding and encoding are related but distinct 
and opposite processes over-all, not just in their output. We draw this inference from the fact that people 
can hear an utterance with an error, decode it correctly, and re-encode it without the error and without 
even realizing that there had been an error. 

Thus, even though there was no new error type discovered, the examination of the unexamined has 
paid off. Even better, it points the way to further research on the decoding process and its relation to 
encoding. 


* WJS recalls hearing no such word from his grandparents, but breakfast was on the table when the milkers returned from the 
barn. It was never necessary to call anyone, least of all WJS. 


46 WILLIAM J. SULLIVAN AND SARAH TSIANG 


REFERENCES 


FISIAK, JACEK. (ED.) 2003. The new Kosciuszko foundation Polish-English dictionary. New York: The 
Kosciuszko Foundation. 

SOBOL, ELZBIETA. (ED.) 2002. Nowy sfownik jezyka polskiego. Warszawa: Wydawnictwo naukowe PWN. 

SULLIVAN, WILLIAM J. 2011. Input, output, and (de)linearization: What we owe to Sydney M. Lamb. 
LACUS forum XXXVI279-89. 

---, & SARAH TSIANG. In press a. “It’s deja vu all over again!”: Are redundancies speech errors? To 
appear in LACUS forum XXXIX. 

---, & ---. In press b. Tactic pattern errors and the architecture of stratificational theory. To appear in 
LACUS forum XXXVIL 

---, & ---. In press c. Substitutions as speech errors: “I resemble that remark.” To appear in LACUS forum 
XL, 

TSIANG, SARAH, & WILLIAM J. SULLIVAN. In press. Unintended blends and the stratification of 
language. To appear in LACUS forum XXXVIII. 

WHITFIELD, FRANCIS J., (ed). 1963. The Kosciuszko foundation Polish-English dictionary. New York: 
The Kosciuszko Foundation. 


This article was first published at lacus.weebly.com. 


LACUS FORUM 


Journal of the Linguistic 
Association of Canada and the 
United States 


VOLUME 41 NUMBER 3, 2020 


CONTENTS 


“ 


10. MODELING TRANSITIONS IN THE LANGUAGE OF PSYCHIATRIC DISORDERS 1 


Jonathan Fine 


11. GRICE’S MAXIMS RECAST: ORTHOCONCEPTS, EXPECTATIONS, AND 12 
THEORY OF MIND IN HUMAN LINGUISTICS 
Douglas W. Coleman 


12. THE SYNTAX-PROSODY INTERFACE: A CASE STUDY OF ENUMERATIONS 23 


Svetlana Kaminskaia 


13. ANCIENT INDIAN GRAMMARS AS DOORSWAYS TO FREEDOM 34 
Karen/Sruti Malcolm 
14. SOME LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY: 43 


TWO GOVERNORS AND THEIR COMMUNITIES 


Jacqueline Nenchin & Susan Feltman 


LACUS Forum 41:3.1-11 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution- NonCommercial license version 3.0; see 





Attp://creativecommons.org/licenses/by-nc/3.0/. 
Copyright © 2020 the Linguistic Association of Canada and the United States. 


MODELING TRANSITIONS IN THE 
LANGUAGE OF PSYCHIATRIC DISORDERS 
JONATHAN FINE 
Bar-Ilan University 


Abstract: The language of speakers with psychiatric disorders contains markers of the specific characteristics 
of the disorders. These linguistic characteristics are not homogeneously distributed in interaction. Rather, 
transitions from the interlocutor to the speaker with a disorder are associated with widely varying amounts of 
the features that index the disorder. Semantic features and interaction features (speech functions) are 
associated with variation in completeness of clauses and hedges in hepatic encephalopathy and speech 
functions are associated with echolalia in autism. Methodologically, the unit of analysis of the transition can 
reveal patterning of the essential characteristics of disorders that are not evident when the unit of analysis is 
a larger unit. For the speaker, the transitions index cognitive processes and for the hearer, the transitions 
indicate the points of salience for difficult interpretation and diagnosis. 


Keywords: clinical linguistics, speech functions, autism, systemic, interactional transitions 


ANALYSING SPONTANEOUS INTERACTIONS of speakers with psychiatric disorders raises the 
question of what linguistic patterns create the impression that the language (1) is not typical for the context, 
and (2) indexes a specific kind of psycho-pathological atypicality. The traditional approach in studying the 
language of psychiatric disorders is to determine the frequency of one or more linguistic variables (e.g., for 
schizophrenia: unclear reference, exophoric reference, lexical cohesion, Rochester & Martin 1979). Usually 
such a variable will be distinctly higher or lower in some affected population compared to a control 
population. However, the problem is that some of the phenomena that attract attention do so because of 
juxtapositions in the flow of interaction. These juxtapositions both track the speaker’s social and cognitive 
states and constitute the data that hearers use to form the impressions of psychopathology. The approach 
presented here is to model the transitions at choice points at various levels of analysis, including exchange 
structures and speech functions. Individual level data show wide variations in how texts move from point 
to point creating specific impressions of atypicality and psychopathology. In hepatic encephalopathy, 
discourse patterning after initiations with temporal information is considerably impaired compared to 
responses to initiations without temporal information. In autistic spectrum disorder (ASD), echolalia is 
more frequent after initiations that demand information in specific ways. 

Methodologically, it may be difficult to generalise from individuals to diagnostic categories, but the 
individual level data show the patterning in language that sounds pathological. The control is achieved by 
comparing patterning in different contexts in a within-subject design. Given the great variability within the 
diagnoses, individual level data is more faithful to what creates the impressions for the hearer than group 


2 JONATHAN FINE 


level data which present an averaged picture that may well not fit with clinical reality. 

The analysis of transitions in spontaneous talk describes both the speaker’s role and the hearer’s role. 
From the speaker's perspective, cognitive processing and the speaker’s parsing of the context, including 
meanings and wordings, influence how the next bit of text is formulated. For example, the speaker has 
notions about what is new or given to the hearer, the role relationships of speaker and hearer, the emerging 
register and genre, and the current kind of exchange in the discourse structure. From the hearer’s 
perspective, the flow of text with its transitions is to be interpreted against the hearer’s expectations, built 
up from interacting in the speech community, for particular microcontexts. The hearer builds and readjusts 
impressions of whether the language stretches the limits of interpretability or just fashions unexpected 
structures and meanings. The patterns of meanings that the hearer builds step by step contribute to 
impressions of register (e.g., what is the field, representing the institutional focus) and the impression of the 
appropriateness and even the sanity of the speaker. Various points in the flow of text are seen as choice 
points in the semantics, discourse structure and lexico-grammar that are critical for how hearers make sense 
of the text and form impressions of the speaker. 


1. MODELLING TRANSITIONS. Spontaneous language can be seen as waves of different meanings that are 
both exchanged between speakers and that are developed consecutively by one speaker in more monologic 
parts of a dialogue. The changes in meaning are obviously dependent on what has just been said. These local 
contexts will be called microcontexts. From a more intra-speaker view, the language tracks the speaker’s 
processing, typical or pathological, in context. In normal interaction and in psychopathology, we hear 
sequences of meanings rather than structures. Our goal is to give detailed linguistic analysis of language in 
context to track the fine temporal resolution of the ups and downs of processing of language and the 
environment. The choice points create transitions at various levels of analysis, including schema elements 
in genres such as narratives, exchange structures, speech functions, and lexico-grammatical realisations. 
Individual level data show wide variations in how texts move from point to point, creating specific 
impressions of atypicality and psychopathology. When the point to point atypicality is across speakers, the 
earlier phenomena can be thought of as triggers that are followed by an atypical linguistic development of 
the interaction. When the point to point atypicalities are within thespeech of a single speaker, then the 
speaker's model of the interaction and of the text are more at issue. 

The questions then to be addressed below are: (1) What is atypical in the rises and falls in language, 
whether uninterpretable or not? and (2) What are the triggers in the contexts for the atypical rises and falls? 
The modelling at the choice points is from two perspectives: the speaker's perspective: what is the speaker 
doing moment-by- moment? And the hearer’s perspective: what does the language sound like moment-by- 
moment? what social impression is being made? In example (1) spoken by a speaker with florid 
schizophrenia, we can ask from the speaker’s perspective, how is the speaker processing the context and 
building a text that is so atypical. From the hearer’s perspective, we can ask at what point, and why, does 
the hearer form the impression that the talk is atypical, uninterpretable and, specifically, sounds psychotic. 


(1) 

Interviewer: ....did someone bring you [here]? 

Patient: did someone bring me ... no nobody brought me ... I flew in my very own plane ... I 
wouldn't trust anybody down at the airport to bring me anywhere 

I: why not? 

P: why not ... why not ... for the very simple fact that I don’t think that they can really handle a 
plane ... unless they are going to lay me down as a little rose and put me back in the box and 
they can fly me around because there’s a million people out there that can meet the same world 


MODELING TRANSITIONS IN PSYCHIATRIC DISORDERS 3 


... it’s just world after world after a world right 

I: yea 

P: and there’s no need to be worried about that and if you want them you can go down and get 
them and if you figure you need plastic ones or other ones if they figure they need those things 
and they want those things it’s entirely up to them (courtesy Drs. G. Bartolucci, J. Pelletier). 


Two kinds of data will be used to address these questions of transitions. The first is the case of hepatic 
encephalopathy, in which there is an effect of temporal meaning in the microcontexts in contrast to 
nontemporal meaning, and also constraining speech functions, in contrast to non-constraining speech 
functions, on the expression of certainty and hedging and sometimes on the completeness of utterances. The 
second set of data presents the effect of constraining, in contrast to non-constraining, microcontexts on the 
frequency of echolalia in children with autistic spectrum disorder. 


2. TEMPORAL MICROCONTEXTS AND CONSTRAINING SPEECH FUNCTION IN HEPATIC 
ENCEPHALOPATHY. Hepatic encephalopathy (HE) is a neurodegenerative disorder brought on by liver 
malfunction. It is characterised by abnormalities of mood and behaviour, disturbances of consciousness, 
personality changes, memory and attention alterations, reduced visuo- practic capacity, and temporal 
organisation and processing. It is this last characteristic of temporal organisation and processing that is 
associated with atypical transitions in spontaneous interaction. The data come from an English native 
speaking 28- year-old female nurse (presented in Lyubman (2007), access to patient courtesy L. Blendis). 
She was informally interviewed twice in the course of the disease, before a liver transplant and then 3 years 
and 3 months later after a transplant. The topics were family, education, occupation, and how the patient 
feels. The interviewer was a female speech and language therapist about the same age as the patient, who was 
blind to the purposes and methods of the linguistic study on HE. 

Given the clinically identified difficulties in sequencing and temporal organisation, the transition 
points of interest were those immediately after temporal compared to nontemporal information. The 
interview was largely composed of exchanges in which the interviewer initiated with either statements or 
questions and the patient responded. The last clauses of interviewer turns with initiations were therefore 
classified as either containing temporal information (see example 2) or not. Nontemporal clauses could be 
causal, additive or adversative, following the possibilities of clause combining of Halliday & Matthiessen 
(2004). 


(2) 
I: Were they ever concerned about you being able to (pause) 
P: When I... First I went into nursing school I... It was part of my ah ... (pause) 
... Other problems have arisen. 


To track the association of temporal and not temporal contexts with simple structural issues, the first clause 
spoken by the patient after these contexts was noted as either a complete or an incomplete clause. 
Incomplete clauses were those with a missing clause element. Clauses with appropriate ellipsis (Halliday & 
Hasan 1976) were not regarded as incomplete. Figure 1 presents incomplete clauses as a percentage of all 
clauses spoken by the patient immediately after interviewer turns. 


4 JONATHAN FINE 


60% - 
50% 


Percent 40% 
incomplete 
clauses 30% 


20% 








O Before transplant 


10% 


o% w@ After transplant 


Nontemporal Temporal 
context context 





Figure 1. Incomplete clauses spoken by the patient immediately after the interviewer as a percentage of all 
clauses in that position. 


Before the transplant, the patient with HE did not complete over 50% of the clauses after the interviewer's 
temporal clauses. In contrast, only about 10% of the clauses were incomplete after nontemporal contexts. 
The transition after temporal information is much more disordered compared to after nontemporals. After 
the transplant, the large number of incomplete clauses after temporal has largely dissipated. 

A more semantic measure of the effect of transitions is the hedging or modality expressed by the 
speaker. Hedging can be expressed in the four ways shown in example (3). 


(3) Expressions of hedging 
Modal operator: I could never remember any of them. 
Modal adjunct: Oh apparently my parents had known that I had ah liver disease a year before. 


Discourse marker: Well I couldn’t keep up at work. 
Metaphor of modality: I think I have a tape recorder. 


The number of expressions of hedging per move (the speech function unit appropriate for hedging) is 
presented in Figure 2 below. 


0.30 
0.25 


Hedging markers 0.20 
per move in the 
answers to 
temporal/non- 0.15 
temporal questions 


0.10 
0.05 





0.00 


Nontemporal Temporal 


© Before transplant 
w After transplant 
Figure 2. Expressions of hedging per move after temporal and nontemporal contexts. 


In temporal contexts, hedging is three times more frequent than in nontemporal contexts before the 
transplant and five times more frequent than in nontemporal contexts after the transplant. As well as being 
more closely associated with incomplete clauses, temporal contexts are also associated with clauses 
expressing hedging. The patient is clearly differentially responding both in structure and in the semantics of 
certainty based on the immediately preceding clause. The patient may have a certain level of hedging in her 
talk, but it is crucial to see that the hedging is not evenly distributed through her interactions. There are 


MODELING TRANSITIONS IN PSYCHIATRIC DISORDERS 5 


large ups and downs of hedging and these are associated with the microcontexts of what the interlocutor is 
saying. As well as the level of the temporal or nontemporal semantics of the preceding clause, transition 
points can be characterised in terms of the microcontext of the prior speech function, that is, an 
interactional unit. Again, we are concerned with the immediate transition point, here, the last move of the 
prior turn. Moves may clearly constrain following moves of the next speaker by demanding information 
(examples 4i, 4ii) or may leave the following move rather unconstrained in giving information (examples 
Aiii, 4iv). The constraining moves clearly indicate what information is demanded in the following move: 
polarity information after yes/no -— questions, the information specified by the particular wh- word after 
wh- questions. In contrast, after the non-constraining moves, the next speaker has more possibilities to 
choose from. After the interlocutor’s speech function of give: information, the next speaker may respond in 
various ways. Similarly, after the speaker's own initiation in an exchange, that speaker may continue in very 
many ways. Theoretically, there are speech functions of demand: information and give: goods and services 
but they are not found in these clinicalinterviews. 


(4) a. I: Where are you from? (wA- question by the interviewer) 
P: I grew up in Wh. 
b. I: Now ok now do you have any children? (yes/no- question by the interviewer) 


P: No. 
I: And ah you went to high school there and (declarative clause) 


P: I went through ah all you know grew up part of this ... my life in Ottawa and 
c. I: So when do you think that’s improved since? (wA- question by the 
interviewer) 


P: Pretty quickly after the liver transplant, within a few monthsactually. I: mhm 
[and] 
P: [I didn’t] have to look at papers all the time. (patient’s initiation) 


As shown in Figure 3, again, there is clearly a greater association of hedging with preceding 
unconstraining speech functions than with constraining speech functions. These unconstraining speech 
functions do not clearly specify what information the speaker is to supply next in the context. It is the 
specific interactional microcontext of demanding: information that is associated with less hedging, whereas 
other speech functions with other functional configurations are associated with more hedging. 


1.2 


0.8 7 
Hedges por 
Cony arsatlonal 
rhc yas 


0.4 7] 





=" —— _ Mei 


Uniconstraining  Constraining 


OBuatora wariaplant | 
8 Atter transplant 


Figure 3. Expressions othedging per move atter conversationally constrainingand not constraining contexts. 


6 JONATHAN FINE 


A variety of phenomena at transition points was found in the spontaneous interaction of a speaker with 
a biologically induced cognitive dysfunction. Firstly, a semantic characteristic of the preceding talk, whether 
it is temporal or not, is associated with both variations in meaning (hedging) and variations in structural 
completion. It is possible, though, that this latter failure to complete clauses is ultimately related to some 
processing substrate that is in common with semantic processing. If the speaker is having difficulty planning 
the meaning, there may be more incomplete clauses. Secondly, the essential characteristic at the transition 
point that is associated with effects after the transition point can be in terms of interactional meaning. 
Importantly, then there may be quite a variety of levels and factors of the microcontext that have to be taken 
into account as interaction develops. The next set of analyses shows more of these factors of the 
microcontext. 


3. THERISES AND FALLSOFECHOLALIAIN AUTISM. Autism is characterised by three main diagnostic 
criteria: impaired social interaction, impaired communication, and stereotyped, repetitive, idiosyncratic 
language and behaviour. This last characteristic directly involving language is especially associated with 
echolalia (the repetition of words, phrases, intonation, or sounds). However, even with those children who 
use echolalia frequently, there are stretches of language with more echolalia and stretches with less. Both in 
terms of basic research into the mechanisms of autism and in terms of designing approaches to reduce the 
echolalia, it is of great value to identify the triggers of the ups and downs ofecholalia. 

The data come from interactions of 8 male speakers of Hebrew diagnosed with autism (American 
Psychiatric Association 1994) aged 8 — 13 years (mean 11.25 years). Each speaker had echolalia in at least 
35% of his utterances (Zilbernagel 2006). There were 4 pairs of speakers who were matched for mean length 
of utterance in morphemes but one of which was high communicative and the other low communicative. 
They were interviewed by a familiar teacher who had worked with the children for at least one school year. 
When there is sufficient text frequency (as for the entire category of statements compared to questions), the 
data are presented as the mean of matched pairs to show the consistency of the patterns. When the linguistic 
analysis involves many subcategories and thus low text frequency for each speaker, the data are pooled for 
all speakers. 

As with the previous data set, the amount of constraint in the microcontext is important for accounting 
for the variation in the interaction. Figure 4 presents the percentage of responses that contain echolalia for 
each pair of speakers as a function of the interlocutor's preceding utterance. Clearly but to varying degrees, 
prior statements are associated with more echolalia than prior questions. Speakers with autism have specific 
difficulties in relating to and interacting with others. Such speakers, then, are particularly vulnerable in 
microcontexts in which the demands of the interaction are less clear. The greater amount of echolalia thus 
tracks this impairment in parsing the context and in formulating an appropriate and fluent response. The 
results clearly show that echolalia is far from evenly distributed in the interactions of these children. Rather 
there are concentrations of echolalia around certain kinds of microcontexts defined by the information 
demands of the prior talk. 

Within the category of statements, there is also considerable variation of amount of echolalia in the 
responses. This variation is attributable to the kinds of statements. Some statements seem to have a function 
that children with autism find easier to interpret. Several kinds of statements are found in (4) (adapted from 
Zilbernagel (2006). 


MODELING TRANSITIONS IN PSYCHIATRIC DISORDERS 





= 100 = 

3 80 a eos 4 Questions 
; — 

5 = 60 | ® Statements 
s+: 450 | 

#8 ao [ | 

> 20 os See 

- 10 

= 0 


pair 1 pair 2 pair 3 pair 4 


Ab bched pairs of speakers 


Figure 4. Percentage of responses thatcontain echolalia foreach pair otspeakersasa function of the 
interlocutor s preceding utterance. 


(5) a. New information 

I: Do you know how to swim by yourself? 

C: floats you by yourself, yes. 

I: yes good also in the sea you can swim. 
b. Correction statements 

I: what are these? 

C: shapes. 

I: these are not shapes these are hats. 
c. Answer to a question 

I: do you know how old the State of Israel is? 54 years. 
d. Tone 2 (rising) statements 

I: No the boy is not going to school he is going to the store? 
e. List statements 

I: Let’s look at a book or at pictures. 
f. Rejection statements 

C: what does the bus do? 

I: we are not talking about buses. 
g. Repetition statements 

I: look is this a big or little ball. 

C: little. 

I: little. 
h. Strengthening statements 

I: pitchfork 

C: pitchfork 

I: correct correct 
i, Connecting statements 

I: let’s look at pictures now 


8 JONATHAN FINE 


The greatly varying rate of utterances with echolalia as a function of the class of the prior statement is 
presented in Figure 5. Three kinds of statements are associated with considerably higher rates of echolalia: 
new information, an answer to a question, and a correction in the prior utterance. Although the argument 
is post facto, it would appear that these statements provide less indication of what the next turn or speech 
function would be. If the interviewer provides new information, then the speaker with autism must decide 
how to react to that information, on-line and rather quickly. Unlike a question, the statement of new 
information does not signal which specific information should be provided or which interpersonal response 
is appropriate. If the other speaker has in fact answered a question, then the speaker with autism has lost the 
opportunity to supply the specific function of give: information that could have been expected and that was 
perhaps being planned. Rather, the speaker has to reformulate both the function of the contribution and 
how it is to be worded. Correcting statements are similar to statements that answer a question in that what 
had been the likely basis for the child’s response has been changed. The interlocutor has changed some 
information that could well have been the starting point for the child’s response or development in the 
interaction. Again, the child is left to reformulate the next contribution. 


B connecting 
‘Brajaction 

= Otone 2 
Ostrengthening 

a eapclon 

‘Onew information 
‘Banswer to a 


question 
Ocorrecting 


Percentage of uferances wth echolaka 





Mean across pairs 


Figure 5. Percentage of utterances with echolalia after difterent statement types. 


It is clear that some types of statements are followed by much more echolalia than others. From the speaker’s 
perspective, the variation in echolalia may be tracking the ups and downs of being able to parse the semantic 
and interactional environment, tasks known to be impaired in autism. Although the arguments for why 
some statements seem to trigger high rates of echolalia may not be established, the differences are strong 
and suggest underlying processing mechanisms that are sensitive to the microcontexts. In these data from 
speakers with autism, a sign of the autism itself, echolalia, is the linguistic feature that is associated with 
changes in microcontexts. More generally, functional characteristics of the microcontext are associated with, 
and may trigger, aspects of linguistic behaviour that are characteristic of the developmental disorder. 
Although only association and not cause has been established, the findings do suggest reasons for the large 
variation in echolalia and suggest remedial strategies. It follows from the findings that it is worth varying 
contexts and microcontexts to fashion those contexts that can reduce the echolalia that is both signalling 
problems in communication and inhibiting easier communication. 

From the hearer’s perspective, the varying amounts of echolalia create a wavering impression of 
stereotyped, repetitive, and idiosyncratic language. Some of the diagnostic criteria of impaired social 
interaction may also be evident in the high levels of echolalia found regularly in certain microcontexts. 
Although the hearer may subjectively sense the varying amount of echolalia, it is only by the systematic 


MODELING TRANSITIONS IN PSYCHIATRIC DISORDERS 9 


analysis of spontaneous interaction that clear patterning emerges. Explicit knowledge of the patterning may 
be able to help hearers predict when there may be disruptive echolalia and even how to help speakers with 
autism reduce the amount of echolalia. 


4. DISCUSSION. The study of transitions in the language of speakers with psychiatric syndromes sets up 
a number of questions that will be discussed in sequence: 1) What are the characteristics of the 
microcontexts that are associated with the changing nature of the patients’ language? 2) What are the 
linguistic characteristics of patients’ language that vary? How are the characteristics of the microcontexts to 
be uncovered? 4) What do the changes in the patients’ language tell us about how the speaker formulates 
language and how the hearer interprets the language? 5) Given considerable individual variation within 
diagnostic categories, how should research move forward? 


4.1. CHARACTERISTICS OF MICROCONTEXTS. The data have shown what must be only a small sample 
of the characteristics of microcontexts that vary with patients’ speech. A semantic characteristic of 
temporality that is expressed in many ways was related to features in the language of a speaker with hepatic 
encephalopathy. The temporal meaning could be expressed in a temporal wA- word (when), adjunct 
(yesterday, at that time), and so on. At quite another level of analysis, the amount of constraint in an 
exchange structure, essentially encoded by different speech functions, may also be the important 
characteristic of the microcontext. In both of these analyses and undoubtedly others, the wording in terms 
of the lexical-grammar (the realisation) may itself be a factor. For example, a demand for information 
realised as a declarative with rising intonation, may be associated with a different kind of response 
compared to a simple yes-no question realised as a finite subject structure. Work on various realisations of 
given functions needs much exploration initself. 


4.2. CHANGING CHARACTERISTICS OF PATIENTS’ LANGUAGE. As with the characteristics of the 
microstructures, only a few characteristics of the patients’ language have been illustrated. Features of 
incomplete clauses and hedging were found for hepatic encephalopathy and features of echolalia in autistic 
spectrum disorders. There are many psychiatric disorders that have important reflexes in language (Fine 
2006) and so there is a large set of possible linguistic features that may show variation based on the 
microcontexts. Some examples are the pressure of speech in mania and schizophrenia, alogia in 
schizophrenia, monotone intonation in depression, and stereotyped and repetitive behaviour and language 
in the autistic spectrum disorders. Studies that consider whole conversations may show distinct differences 
between the language of an affected population compared to an unaffected population. However, the 
varying ups and downs of language characteristics in the speakers of interest are not accessible when the 
unit of analysis is the overall conversation. Rather the unit of analysis has to be the transition in a 
microcontext. Some problematic characteristics of patients’ language may not emerge when the unit of 
analysis is the conversation. Such obscured characteristics may nonetheless be important in understanding 
the language of the disorder. This point is taken up in the following subsection. 


4.3. UNCOVERING CHARACTERISTICS OF THE MICROCONTEXTS. A few characteristics of the 
microcontexts have been shown. Certainly, given a broad range of disorders, there are many possible 
linguistic features prior to a transition point that can be associated with many possible features of patients’ 
speech after a transition point. There are two primary clues of where to search for relevant linguistic 
features. The first is the clinical description of the disorder. If for autism, there is a clinical description of 
one-sided social communication, then exchange structure is a plausible locus of interest. Ifin attention deficit 
hyperactivity disorder (ADHD) there is a clinical description of “blurts out answers before questions have 


10 JONATHAN FINE 


been completed,” then the investigation of the beginnings of turns is indicated. These clues focus on an 
interspeaker perspective; how do the speakers function in interaction with others. A second clue of where to 
search is by developing hypotheses from the cognitive or neurobiological understanding of the disorder. 
This intraspeaker starting point requires some such deeper understanding of the disorders. These 
understandings are often not available or not well established conceptually or empirically. When they are 
available with some level of confidence (perhaps executive functioning in ADHD), then the linguistic 
investigation can focus on specific structures or processing (perhaps story grammar components or 
cohesion reflecting the executive functions of planning and organising for ADHD). 


4.4. THE SPEAKER’S ROLE AND THE HEARER’S ROLE. As discussed earlier, focusing on the speaker 
raises questions of how the speaker is constructing the interaction step by step. The perspective may be a 
more cognitive one in considering memory issues, planning issues and assembly issues at different levels of 
analysis. On the other hand, the perspective may be more social in considering the development of the social 
process, perhaps in terms of generic structure elements or exchange structures and their constituents. In 
terms of the hearer, there are many problems that need extensive work. We do not understand when in the 
flow of an interaction a hearer forms an impression of a certain kind of disorder. Since there is a 
constellation of linguistic features at different transition points, we have to study how speakers implicitly 
integrate clues from different linguistic features to form an impression of a disorder. For example, the 
interrupting nature of attention deficit hyperactivity disorder may be formed by some combination of 
interrupting in certain grammatical structures, with a certain amount of insistence, at given points in turn 
structure or exchange structure. Furthermore, empirical study of hearers is needed to explore the points in 
a developing interaction that contribute to forming impressions of atypicality. Example (1) from a speaker 
with florid schizophrenia is clearly unintepretable by the end. However, at what point does the hearer first 
notice something atypical and at what point is the atypicality identifiable as the clinical phenomenon of 
thought disorder? This dynamic unfolding of text in psychiatric disorders demands careful study of 
sequences of transition points and sequences of stretches of language that follow them. Then these sequences 
must be tested empirically in context for the effect they have on hearers. The effects are certainly cumulative 
in ways that have not yet been explored. 


4.5. INDIVIDUAL VARIATION. Within each psychiatric disorder, there is considerable variation from 
individual to individual. In many disorders, including schizophrenia and ADHD, the diagnostic criteria are 
structured such that two individuals can meet the criteria for a disorder but have no overlapping diagnostic 
criteria. Those two individuals may then have rather different language characteristics at transition points. 
In other cases, an acute phase of a disorder may be different from a milder and perhaps more common phase. 
The diagnostic criteria and the definition of disorders do, though, produce reliable diagnoses. Such 
reliability and relative stability suggest that there are within group similarities that the linguistic 
investigations can tap. Given the wide amount of individual variation, though, case studies and series of 
case studies can throw light on the phenomena at transitions that group studies with rather heterogeneous 
populations cannot. 

There is much to be learned about the nature of the language of psychiatric disorders by looking 
carefully at the transition points in spontaneous conversations. As far as the sampling on interaction 
reasonably reflects ecologically valid contexts, an understanding of what is atypical in the talk is achievable. 
There are difficult methodological obstacles in developing control microcontexts for the transitions of 
interest, but these seem not intractable. 


MODELING TRANSITIONS IN PSYCHIATRIC DISORDERS 11 


REFERENCES 


AMERICAN PSYCHIATRIC ASSOCIATION. 1994. Diagnostic and statistical manual of mental disorders. 
Fourth edition. Washington DC: APA. 

FINE, JONATHAN. 2006. Language in psychiatry: A handbook for clinical practice. London: Equinox. 

HALLIDAY, MICHAEL A. K., & RUQATYA HASAN. 1976. Cohesion in English. London: Longman. 

HALLIDAY, MICHAEL A. K., & CHRISTIAN M. I. M. MATTHIESSEN. 2004. An introduction to 
functional grammar. Third edition. London: Edward Arnold. 

LYUBMAN,A. 2007. Language impairment in hepatic encephalopathy. Unpublished PhD dissertation, Bar- 
Ilan University. 

ROCHESTER,S.,&J.R. MARTIN. 1979. Crazytalk:A studyofthediscourseoftschizophrenic speakers. New 
York: Plenum. 

ZILBERNAGEL,S. 2006. Theuseofecholaliaincontextbyautisticchildren.Unpublished PhD dissertation, 
Bar-Ilan University. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:3.12-22 (2020) 


LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





GRICE’S MAXIMS RECAST: ORTHOCONCEPTS, EXPECTATIONS, AND THEORY 
OF MIND IN HUMAN LINGUISTICS 
DOUGLAS W. COLEMAN 
University of Toledo 


Abstract: It is shown that Grice’s Cooperative Principle (1989) means that (a) speakers adhere to the Maxims 
as rule-like conditions and (b) hearers assume that speakers generally adhere to the Maxims. This is why Grice 
must characterize a great deal of what constitutes ordinary, everyday speech as “violations” of the Maxims. 
This paper presents an alternative view, one in which speakers and hearers each have their own 
understandings and their own agendas — one which may or may not, in fact, be cooperative. By taking the 
Human Linguistics (a.k.a., Hard-Science Linguistics) approach of Yngve (1996), this paper shows that the 
communicative behaviors that Grice’s Maxims are intended to account for can be explained without 
assuming the Cooperative Principle and without having to characterize much human behavior as involving 
“violations” of Grice’s Maxims, which he adopted by assumption. 


Keywords: Cooperative Principle, Grice’s Maxims, Human Linguistics, Hard-Science Linguistics, 
orthoconcepts, expectations, theory of mind 


Languages: English 


ARE GRICE’S MAXIMS ACTUALLY “MAXIMS”? Grice’s (1989:26-27) Cooperative Principle is 
encapsulated in the maxims of Quantity, Quality, Manner, and Relevance. The Maxim of Quantity states 
that the speaker should give as much information as is required, but no more. The Maxim of Quality states 
that the speaker should neither state intentional falsehoods nor speculations as if factual. The Maxim of 
Manner focuses on avoiding various problems with clarity (obscurity, ambiguity, wordiness, disorder). The 
Maxim of Relation states that the speaker should maintain relevance in what is said. Grice justifies the 
maxims through the Cooperative Principle, summed up in this assumption about human interaction 
(1989:26): 


Our talk exchanges ... are characteristically, to some degree at least, cooperative efforts; and each 
participant recognizes in them, fo some extent, a common purpose or set of purposes, or at least a 
mutually accepted direction. [emphasis added] 


Grice emphasizes the general truth of his claims and offers this to anyone who might question their general 
application: 


A dull but ... adequate answer is that it is just a well-recognized empirical fact that people do behave 
in these ways; they learned to do so in childhood ... and indeed, it would involve a great deal of effort 
to make a radical departure from the habit. It is much easier, for example, to tell the truth than to 
invent lies. (Grice 1989:29) 


GRICE’S MAXIMS RECAST 13 


Despite his broad claim, it is clear that Grice himself was aware of the descriptive limits to his Principle 
and Maxims. In his statement quoted earlier note that he was careful to hedge with “characteristically,” “to 


> « 


some degree at least,” “to some extent,” and “or at least”— which taken together constitute a tremendous 
amount of hedging in a single statement of generalization. The choice of “principle” and “maxim” as labels, 
therefore, is questionable.’ A principle is a general statement with broad application, not one rife with 
counterexamples. A maxim is generally understood to be a general truth, in particular a general truth about 
behavior. Human behavior is not so straightforward as the maxims imply, nor is it clear that human 
interaction is by nature primarily cooperative, rather than primarily self-serving.’ 

People frequently get along in conversation without having common understandings and while 
behaving uncooperatively (at least in the sense Grice seems to mean). People cover ignorance by presenting 
inadequate information as if adequate (violating the maxim of quantity). To seem smarter, they provide 
unneeded detail (violating the same maxim in the opposite direction). They lie overtly (violating the maxim 
of quality). They try to impress by using obscure terms (violating the maxim of manner). They deceive by 
speaking obliquely, expecting the hearer to draw a false inference as a result (violating the maxim of 
relevance). Grice understands the Cooperative Principle in terms of common purpose, yet events involving 
individual, rather than cooperative, agendas are so pervasive that Grice’s maxims appear morally normative 
or simply false. For several reasons related to these, behavior has been harder to explain than selfishness in 
purely biological terms. Levinson (2000) suggests the maxims be considered instead as “heuristics,” (27-35) 
to guide the speaker. However, the relabeling offered by Levinson does not resolve the issue of failures to 
fulfill of the maxims as an explanatory problem; it merely admits that there is a problem with Grice’s 
maxims as maxims Per se. 


1. WHY DOES THE ISSUE REMAIN UNRESOLVED? Grice’s approach arises out of a tradition of rule- 
based explanations that treat counterevidence as exceptions to rules rather than as challenges to the validity 
of the generalization. It manages this by assuming that the reasons a speaker might adhere to one of the 
Maxims vs. fail to fulfill it are outside the scope of interest. In this paper, I accept a broader understanding 
of what is linguistic vs. nonlinguistic; in the Human Linguistics (a.k.a., Hard-Science Linguistics) 
framework, linguistic properties are the properties of any system — at the individual or group level — that 
must be modeled in order to account for human communicative behavior (Yngve 1996:121-133). Human 
Linguistics avoids limiting the scope of what is thought of as linguistic to abstract, logical-domain entities 
of language. In this way, it moves the focus of linguistics to people communicating in the real world. 

The traditional view takes language as primary and people communicating as an after- thought, in the 
process creating domain confusion (Yngve 1996:24): it reifies abstract entities of language so that these 
entities can thus be seen as transmitted (once manifested in physical form) from speaker to speaker. Since 
traditional linguistics takes language as primary and people communicating as secondary, it also imposes a 


' The choice of labels is especially interesting when we look at the amount of Grice’s (1989) hedging and his discussion of types of 
failure to fulfill maxims, on the one hand, and compare it with the sweeping nature of his claims, on the other hand. There will be 
more on the ways Grice says his maxims can fail to be fulfilled, below. 

* It has generally been taken for granted that the evolutionary history of a species exerts powerful selective pressure on biological 
factors that shape an organism’s behavior. Individuals whose behavior leads to the spread of their genes are, in an evolutionary sense, 
more successful than those whose behavior does not. Until the second half of the twentieth century, this made it harder for 
evolutionary biologists and ethologists to explain altruistic behavior than to explain self-serving behavior; they had to depend on 
explanations such as the reciprocal altruism of Richard Trivers (e.g., Trivers 1971) and others. Richard Dawkins (1989, originally 
published in 1976) presents a readable explanation of how “the selfish gene” (that is, all genes as selfish entities) can actually lead to 
altruistic behavior in an organism. In short, in the sciences in which observation of real-world entities counts more than Grice’s 
“well-recognized empirical fact” (read “assumption”), cooperative behavior has been harder to explain than selfishness in purely 
biological terms. 


14 DOUGLAS W. COLEMAN 


kind of prescriptivism on the attempt to describe what the people are doing in conversation, albeit one that 
takes the form of observational bias driven by philosophical idealism rather than the good-bad behavior 
kind of prescriptivism of grammarians. From this starting point, it is easy to see why Grice (and many others 
since) have been able to ignore the overwhelming bulk of counterexamples that challenge the Cooperative 
Principle, and thus, Grice’s Maxims as maxims per se. 


2. HOW CAN WE ESCAPE FROM DOMAIN CONFUSION AND DESCRIBE LINGUISTIC 
BEHAVIOR WITHOUT THE PROBLEM OF UNFULFILLED MAXIMS? Let’s consider the ways in which 
Grice himself noted that his maxims can be left unfulfilled by a speaker. Grice actually identifies four 
different ways in which a speaker might fail to fulfill his maxims: violating a maxim, opting out of the 
Cooperative Principle, avoiding a clash of maxims, and flouting a maxim. Grice distinguishes between 
violating and flouting a maxim on the basis of the former being covert and thus misleading the hearer, and 
the latter being “blatant” and thus “not... trying to mislead” (1989:30). 

All four supposed types of lack of fulfillment of maxims will be discussed within examples below. Later 
I will present a unified view that takes into account not only cases in which the Cooperative Principle is 
purportedly the driving force but also cases in which the speaker is clearly not cooperating with the hearer. 
I argue that we should start with as few assumptions as possible and instead focus on what happens when 
people do different kinds of things while communicating. I will show that the speaker's adherence or failure 
to fulfill Grice’s Maxims is very simply motivated by (a) the speaker’s own understanding of relevant 
properties of the hearer, and by (b) what changes in those properties the speaker expects to accomplish. To 
put it simply, sometimes the speaker’s expectation is cooperative; sometimes it is not. 


2.1.A PURPORTED NON-VIOLATION OF THE MAXIMS. Grice uses the example (1) as one in which 
“no maxim is violated, or at least in which it is not clear that any maxim is violated” (Grice 1989: 32). 


(1) A:Iam out of petrol. 


B: There is a garage round the cornet.’ 


Grice does note conditions (that B thinks the gas station is or may be open, that it has gas to sell, etc.). 
However, he does not go into these (note what was said above in regard to common assumptions about what 
is linguistic vs. non-linguistic). Grice still assumes that B’s motivation is based on the Cooperative Principle. 
The problem is this: we have no way of knowing from Grice’s example whether B’s motivation is cooperation; 
Grice merely assumes that he is.* 

Maybe B is a passerby who feels sympathy for A’s plight and wants to help. B may believe that there is a 
gas station around the corner, that it is open, etc. In this case, Grice’s claim fits: B is adhering to the 
Cooperative Principle and no Maxim is being left unfulfilled. 

Perhaps B is a good Samaritan who happens to have a can of gas in his trunk but for some reason, wants 


° Translations are offered for any reader who is too American to understand “petrol” and those too young to understand “garage” 
in this context. “Petrol” is, of course, British English for “gasoline.” Garages (yes, even in America) used to be where people bought 
gasoline; garages performed automotive repairs and maintenance as well as selling gasoline 

“Some readers of Grice infer that his Maxims concern hearer interpretations, not speaker intentions. I believe this is an unconscious 
attempt to “repair” Grice’s Maxims. In fact, Grice (1989) says that in the example given in (1) “B would be infringing the maxim ‘be 
relevant’ unless he thinks, or thinks possible, that the garage is open, and has petrol to sell” (32). He explains that “it is just a well- 
recognized empirical fact that people do [adhere to the maxims] ... and indeed, it would involve a great deal of effort to make a 
radical departure from the habit” (Grice 1989:29). In short, Grice’s Maxims are said to be violated when speakers depart from them; 
hence, they are seen as relating primarily to speakerbehavior. 


GRICE’S MAXIMS RECAST 1 


A to experience a selfless act. He sends A around the corner to a non- existent gas station long enough to 
leave the can of gas for A and disappear. It is hard to know whether this is or is not even a violation of the 
Cooperative Principle (verbally yes, nonverbally no?). A ends up with gasoline, but has been sent around the 
corner momentarily on a wild goose chase. 

On the other hand, maybe B is a thief who notices that A’s wheel covers are extremely expensive and 
can readily be resold. B may simply want to get A out of the way for a few minutes so that he can make off 
with the wheel covers with no fuss. B may be saying something he believes is true (if he thinks the corner is 
far enough away that he will have the time he needs before A sees there is no gas station). In this case, even 
though A is fulfillingthe Maxims, A and B definitely are notengaging in an interaction with “a common 
purpose.” 

There are many other possibilities in which B may actively attempt to deceive A not only about his 
purpose but also about the gas station. One is that B knows that the nearest gas station is seven blocks away. 
B may point in the correct direction, saying “There is a garage round the corner,” flouting the Maxim of 
Quantity by not saying how far around the corner (A might balk and stay where he is if he knew how far). 
B may point in the wrong direction, saying “There is a garage around the corner,” flouting the Maxim of 
Quality. B might point down the street and say, “There is a garage round the corner,” flouting the Maxim 
of Relevance because he knows it is an automotive repair shop only, and does not sell gasoline. 

In short, B’s truth-telling (or lack of intent to deceive) has nothing to do with whether he speaks in 
adherence with Grice’s Cooperative Principle, or whether he adheres to the Maxims. 


2.2. ANOTHER PURPORTED NON-VIOLATION. Grice presents the following example (2) as “a little less 
clear” in regard to how a speaker implicates that he is observing the Maxim of Relation.” 


(2) A: Smith doesn’t seem to have a girlfriend these days. 
B: He has been paying a lot of visits to New York lately. 


Grice says that here “B implicates that Smith has, or may have, a girlfriend in New York” (1989:32). He 
declines to provide the conditions, but compares them to those for “There is a garage round the corner” in 
(1) — that the speaker must know the statement and implicature are both true or at least likely to be true. 

There are, however, other very likely possibilities. One is that Smith indeed has (or may have) a girlfriend 
in New York; if B knows that A knows that Smith met awoman from New York and was interested in her, then 
Grice’s notion about B’s implicature holds. However another possibility is that Smith has no time for a 
girlfriend because he is spending too much time on business trips to New York; B might speak with this 
expectation of understanding on A’s part if B knows that A knows that Smith is a workaholic who travels to 
New York on business. In both cases, B is speaking cooperatively, yet the implicature about the existence of 
a girlfriend is vastly different (girlfriend / no girlfriend). Also, in both cases there is no apparent failure to 
fulfill a Maxim. B, on the other hand, may be (in Grice’s terms) opting out. Perhaps B knows Smith has a 
New York girlfriend, but it is A’s ex-girlfriend. Suppose B knows that A is very interested, in Smith’s New 
York business deals and wants to distract A from talking about a topic that would be potentially very 
distressing to A. In this case, A’s purpose or accepted direction in the conversation is to find out about 
Smith’s girlfriend, but B’s is precisely the opposite. 

B might similarly be opting out because he knows Smith has “come out” and has a boyfriend in New 


>I use Grice’s term “implicate” rather than the more usual “imply” since I want to be clear that I am, for the sake of argument, 
temporarily accepting his construct of implicating. The use does not indicate an endorsement of the construct or the term. 


16 DOUGLAS W. COLEMAN 


York, knows that A has in the past been verbally abusive about homosexuals, and wants to avoid what he 
regards as a distasteful scene with A. 

On the other hand, suppose B knows that A and Smith are a gay couple. B further knows that A thinks 
that B thinks A and Smith are heterosexual (and thus not a couple). B knows that A worries that Smith is 
having an affair with a man in New York (but is not, as far as B knows). If B has an intense dislike for A and 
wishes to cause him distress, B says “He has been paying a lot of visits to New York lately” to cause A to 
believe Smith is being unfaithful. Neither A nor B is being cooperative in such a case. A is flouting the Maxim 
of Quality by referring to a girlfriend. B is flouting the Maxim of Relation by giving enough information 
that he knows bears no true relation to what A has asked in an attempt to purposely mislead A. 

If A is a woman and she and Smith are married, a similar set of conditions could hold, with the 
implicature that the affair was with a man or woman, depending on what B knows about A’s understanding 
of Smith’s possible amour in New York. Now suppose B is expecting to undermine A’s marriage with Smith 
because B expects to find himself as A’s love interest when the relationship falls apart. In this case, A would 
be adhering to the Cooperative Principle, but B would not (again, flouting the Maxim of Relation so as to 
mislead). 

Thus, if we view the Maxims as Grice intends, that is, as describing norms of speaker behavior, the 
Maxims provide no way to explain why there will be different interpretations by A or why B will sometimes 
behave cooperatively and other times not. 


2.3. A PURPORTED VIOLATION DUE TO A CLASH. Grice (1989:32) gives the example of A plan- ning 
an itinerary for a vacation in France; B knows that A wants to visit their mutual friend C as long as it does 
not cause too much of a detour from his route (3). 


(3) A: Where does C live? 
B: Somewhere in the South of France. 


What B says is, according to Grice, an infringement of the Maxim of Quantity that results from B 
knowing that he lacks sufficient information to be of use to A (which would be a violation of the Maxim of 
Quality if he were to try to give more). This is what Grice calls a clash of Maxims, resulting in the failure to 
fulfill one of them. 

Suppose that B knows A’s time and resources are limited and that A’s other plans all involve locales 
along the northern coast of France. In this case, more detail is unneeded, because B knows A will not be 
able to go out of his way to the South of France. In this case, there is no failure to fulfill any Maxim. 

A asks B where C lives, we might assume, because B has closer ties to C than does A. Suppose B has had 
a falling out with C. B may be suggesting this to A by purposely giving the vague response “Somewhere in 
the South of France” in an affectedly offhand manner. In this case, we should not be surprised if A’s response 
to B were an awkward pause followed by something like “I thought you and C were close friends!” In this 
case, A’s understood direction for the conversation is to find out where C lives so as to plan his itinerary, 
but B’s understood direction is to seek sympathy for his falling out with their friend C. They do not share a 
mutual direction for the conversation. Further, in this case, B is violating (not flouting) the Maxim of 
Quantity to create the expected understanding in A.° 

Perhaps on the other hand B does know C’s exact address, knows it is in the South of France, and also 
knows that even though A does not have time or resources to travel to the South of France, C would happily 


This is a violation, not flouting, of the Maxim of Quantity in the sense that B has to know that A knows that B knows where C 
lives — or else B would expect A to understand B’s vagueness in some other way. 


GRICE’S MAXIMS RECAST Ly 


travel to the northern coast to see his old friend A. But B does not want to give this information to A because 
he is jealous of A’s past closer friendship with C, so he implicates that he doesn’t have it (a deception). This 
would also involve flouting of the Maxim of Quantity. 


2.4. WHAT GRICE’S EXAMPLES REALLY SHOW. In short, to know whether or not there is what Grice 
would describe as violation, flouting, opting out, or clash of Maxims requires that we know what speakers 
know about each other and that we know what their agendas are. We cannot make a priori assumptions 
about the facts of the situation and project them onto speakers and hearers, and more importantly, neither 
can we assume that speakers are behaving cooperatively. There are simply too many everyday situations in 
which their agendas are not aligned. 

As Maturana & Varela (1992) point out, when two or more organisms are engaged in a “structural 
coupling” (75), the system of which they are a part can remain intact even as the organisms involved 
undergo very independent dynamic changes of state (75, 175-176, 193- 19). Further, as many have noted, 
the “conduit metaphor” or communication, in which “mental representations are poured into the conduit 
and are extracted from it, without undergoing modifications,” simply contradicts what is known about the 
neurological basis of cognition (Balconi & Amenta, 2008:93). As a closer look at Grice’s own examples 
shows, violations of the four Maxims — purported cases of “uncooperativeness” — can actually result in 
maintaining or strengthening, rather than damaging, the stability of the social system in which the 
purported violator is engaged (see, for example, Reynolds et al., 2013; Smith et al., 2014). At the very least, 
violations frequently advance the agenda of the speaker within the mutually-shared environment. 


3. A PARTIAL HUMAN LINGUISTICS FORMALIZATION. I will take the “somewhere in the South of 
France” example (3) from Grice (to show how consideration of different understandings and expectations 
can be formalized in a Human Linguistics framework. Here three possible expanded interactions are 
provided, (4) — (6), reflecting situations presented above. 


(4) A: Where does C live? 

B: Somewhere in the South of France. 

A: Oh, I thought it was near Bretagne. Too bad. 
(5) A: Where does C live? 

B: Somewhere in the South of France. 

A: I thought you and C were close friends! 
(6) A: Where does C live? 

B: Somewhere in the South of France. 


A: I see what you mean. That is a bit far to ask him to go. 


In (4), we can represent a property [A] to indicate A’s understanding that C lives near Bretagne (in the 
north) as [A]<[C]<lives near Bretagne>>. In Human Linguistics, models of systems (such as models of 
people, objects, and so on) are indicated with their names in square brackets [ ]. Properties of systems are 
indicated with values in pointed brackets < >. Thus [A]<[C]<lives near Bretagne>> indicates a model of a 
system, [A], and of a property of A, <[C]<lives near Bretagne>>; the latter represents A’s understanding of 
a system [C] as having the property <lives near Bretagne>. This type of property, representing one person’s 
understanding, is referred to as an orthoconcept (Yngve 2006, 2007; Coleman & Sypniewski, 2017). This is 
A’s initial understanding of C when the conversation begins. B’s under- standing is quite different and can 


18 DOUGLAS W. COLEMAN 


be represented as [B]<[C]<lives in the South of France>>. When B tells A “Somewhere in the South of 
France,” this changes A’s understanding to match B’s, that is, [A]<[C]<lives in the South of France>>, which 
in turn implies that A’s prior understanding [A]<[C]<lives near Bretagne>> is negated as [A]<[C]<~lives 
near Bretagne>> (where the “~”is Boolean negation, so “<~lives near Bretagne>" can be read as “does not 
live near Bretagne”). We can further model why B speaks with an expectation that [A] will not travel to visit 
their friend C. If B understands that A will be staying in Bretagne [B]<[A]<will visit Bretagne>> and does 
not have time to visit the South of France [B]<[A]<cannot go far from Bretagne>>, then he can easily have 
the expectation that “Somewhere in the South of France” is as much information as A needs in order to 
know that he cannot take a side-trip to visit C. 

In (5), we have a more complex situation, one that involves one speaker’s understanding of another 
speaker’s understanding. This is the so-called “Theory of Mind”’ of Premack (see Premack & Woodruff 
1978); for a broader overview of how Theory of Mind is dealt with in Human Linguistics, see Coleman 
(2011). In (5), A understands that C lives in the South of France [A]<[C]<lives in the South of France>> 
and understands that B, who is closer to C, not only shares the understanding [A]<[B]<[C]<lives in the 
South of France>>>, but also understands that that B almost certainly has C’s exact address 
[A]<[B]<[C]<lives at address X in the South of France>>>.° Thus, when B responds “Somewhere in the 
South of France,” A interprets what B says as unwillingness to provide the address. A knows that this 
unwillingness on B’s part could arise from a number of causes. In order for B’s speech to be a violation of 
the Maxim of Quantity rather than a case of B flouting the same Maxim, A must understand that B is not 
being deceptive, that is, [A]<[B]<~deceiving A>>. Perhaps A’s interpretation is affected by a pained 
expression on B’s face when he says, “Somewhere in the South of France.” This may cause A to have the 
understanding we could represent as [A]<[B]<is uncomfortable talking about C>>. Since A understands 
that B and C have been very close friends, [A]<[B]<[C]<close friend>>>, A infers a falling out between B and 


>> 
! 


C, and says with surprise, “I thought you and C were good friends!” — not really a question about whether 
they are friends, but one about the state of that friendship. 

In (6), we have conditions similar to those of (5). A understands that C lives in the South of France 
[A]<[C]<lives in the South of France>>. A understands that B shares this understanding [A]<[B]<[C]<lives 
in the South of France>>>. A understands that that B almost certainly has C’s exact address 
[A]<[B]<[C]<lives at address X in the South of France>>>. A understands that B is not being deceptive 
[A]<[B]<~deceiving A>>. A understands (initially) that C might want to come up from the South to 
Bretagne, where A will be staying [A]<[C]<willing to travel to Bretagne to visit>>. However, when B says, 
“Somewhere in the South of France” (which they both already know to be the case), A takes B’s speech as 
being about the distance, not as a direct response to his request for C’s contact information. A then has the 
understanding that B understands C would not be willing to travel [A]<[B]<[C]<~willing to travel to 
Bretagne to visit>>> or perhaps that B understands that C would be willing to travel [A]<[B]<[C]<willing 
to travel to Bretagne to visit>>> but that B has the opinion that A would be inconveniencing C too much by 
suggesting he make the trip [A]<[B]<[A]<inconveniencing C too much>>>. When A says, “I see what you 


’L use the term because it is the common way of referring to such understandings. However, it comes with a warning that the term 
“Theory of Mind” is potentially misleading. First, its reference to “theory” may imply for some a mental process of theorizing similar 
to what is done by a scientist- as-observer, which is not at all intended here. Also, the term “mind” is fraught with potential for 
domain confusion arising from unwarranted assumptions of mind-body dualism; Human Linguistics explicitly rejects dualism, as 
it is the cause of problematic domain confusions. 

® Tam aware that the notation can become a bit like Who’s on First?Read [A]<[B]<[C]<lives in the South of France>>> as “A has an 
understanding that B has an understanding that C lives in the South of France.” Read [A]<[B]<[C]<lives at address X in the South of 
France>>> as “A has an understanding that B has an understanding that C lives at some specific address X in the South ofFrance.” 


GRICE’S MAXIMS RECAST 19 


mean. That is a bit far to ask him to go,” he is backing off from what he now understands B to think is an 
onerous burden for A to place on C, and that C might even refuse. 


4. HOW DIFFERENCES IN EXPECTATIONS PLAY THEIR ROLE. In each case, I have briefly outlined 
the differences in the contextual conditions that distinguish each situation in (4) - (6), above. Now I will 
compare the four situations in more depth by presenting a Human Linguistics model of how B's 
expectations differ in (4) - (6) depending not only on his understanding of the situation, but also including 
his understanding of A’s understanding of things. 


(7) (a) <[A]<will visit Bretagne>> x 
<[A]<[C]<lives near Bretagne>>> x 
<[A]<cannot go far from Bretagne>> x 
<[C]<lives in the South of France>> x 
<emit sound of “Somewhere in the South of France”> :: 
<expect / <[A]<[C]<~lives near Bretagne>>>> 
(b) <[A]<emit sound of “Oh, I thought it was near Bretagne.”>> :: 
<~expect / <[A]<[C]<~lives near Bretagne>>>> 
(c) <expect / <[A]<[C]<~lives near Bretagne>>>> : <del-exp>, At] 
(d) <~expect / <[A]<[C]<~lives near Bretagne>>>> x <del-exp>:: 


<[A]<[C]<~lives near Bretagne>>> 


The expectation procedure in (7) describes B’s expectation of how his saying “Somewhere in the South 
of France” will affect A in the case of (4) above (Yngve, 1996:263). The first part, (7a), is a setting procedure 
(Yngve, 1996:163-165); it states the background conditions which establish B’s expectation of an effect on 
A — they are in boldface in the first four lines of (7a). The “x” at the end of each line indicates Boolean 
logical AND. The four conditions mentioned plus B’s actually saying “Somewhere in the South of France” 
together trigger (set) B’s expectation of a change in the properties of A, specifically, that A will go from 
understanding that C lives near Bretagne to understanding that C does not live near Bretagne: <expect / 
<[A]<[C]<~lives near Bretagne>>>>. Part (7b) states that when B hears A say “Oh, I thought it was near 
Bretagne, he no longer expects to change A’s understanding (B realizes it has just changed). Parts (7c) and 
(7d) consist of logical mechanisms to describe how the fulfillment of the expectation leads to the appropriate 
output of the expectation procedure. In this case, that output is the change in B’s understanding of A’s 
understanding. One of the initial conditions was that B understood A to understand that C lives near 
Bretagne (see <[A]<[C]<lives near Bretagne>>> in [7a]); the final output is B’s revised understanding of A’s 
understanding, that C does not live near Bretagne (see <[A]<[C]<~lives near Bretagne>>> in [7d]).’ 


’ It is conventional in Yngve (1996:263) for the last line of an expectation procedure to consist of a control procedure (indicated 
with a single colon “:”), which sends a momentary Boolean truz “pulse” as the output of the completed expectation procedure. 
However, this convention ispresent in Yngve (1996) because Yngve’s examples assume that the expectation procedure is to be 
followed by another procedural property (a task procedure, a selection procedure, or an expectation procedure), which is initiated 
by a Boolean truz “pulse.” In the present case, the expectation is fulfilled bya long-term change in B’s understanding. A pulse output 
froma control procedure would set the orthoconcept to rruz but then would immediately reset it to ratsz, which would not accurately 
describe what is going on. The orthoconcept <[A]<[B]<A inconveniencing C too much>>> must be triggered (set) by a setting 


cc 9D 
ee 


procedure (indicated by a double colon “::”) so as to stay Boolean trus until explicitly revised. 


20 DOUGLAS W. COLEMAN 
(8) (a) <[A]<will visit Bretagne>> x 
<[A]<[B]<[C]<close friend>>>> x 
<[A]<[C]<lives in the South of France>>> x 
<[C]<lives in the South of France>> x 
<[C]<lives at address X in the South of France>> x 
<has had falling out with C> 
<emit sound of “Somewhere in the South of France’> :: 
<expect / <[A]<B and C have had a falling out>>> 
(b) <[A]<emit sound of “I thought you and C were good friends!”>> :: 
<~expect / <[A]<[B and C have had a falling out>>> 
(c) <expect / <[A]<[B and C have had a falling out>>> : <del-exp>, At] 
(d) <~expect / <[A]<[B and C have had a falling out>>> x <del-exp>:: 
<[A]<<[B and C have had a falling out>>> 


In (5), we have the conditions that are repeated as properties of [B] in the expectation procedure shown in 
the first six lines of [8a], again in boldface. This time, however, we describe a very different expected 
outcome. In this situation, [B] expects to revise A’s understanding so that A will come to understand that 
B has had a falling out with C — see <expect / <[A]<B and C have had a falling out>>> in the last line of 
(8a). While the mechanism of the expectation procedure in (8) is very similar to that in (7), the outcome of 
these different conditions is that B’s understanding of A’s understanding has been revised. When B hears A 
say “I thought you and C were good friends!” B’s expectation is fulfilled; B now understands that A 
understands B has had a falling out with C. 


(9) (a) <[A]<will visit Bretagne>> x 
<[A]<[C]<lives in the South of France>>> x 
<[C]<lives in the South of France>> x 
<A inconveniencing C too much> x 
<emit sound of “Somewhere in the South of France”> :: 
<expect / [A]<[B]<A inconveniencing C toomuch>>> 
(b) <[A]<emit sound of “That is a bit far to ask him to go.”>>:: 
<~expect / [A]<[B]<A inconveniencing C too much>>> 
(c) <expect / [A]<[B]<A inconveniencing C too much>>> : <del-exp>, Atl 
(d) <~expect / [A]<[B]<A inconveniencing C too much>>> x <del-exp>:: 


<[A]<[B]<A inconveniencing C too much>>> 

In (6) — in the case in which B has the opinion that for A to ask C to travel to Bretagne would be 
inconveniencing C too much, we have the conditions shown in the first four lines of (9a). If all of these are 
part of B’s understanding, then when B says “Somewhere in the South of France,” it triggers the expectation 
<expect / [A]<[B]<A inconveniencing C too much to ask C to make the trip to Bretagne. When B hears A 
say, That is a bit far to ask him to go,” B understands that A’s understanding has been affected; that is, the 
expectation has been fulfilled. C too much>>>, which can be read as “A understands that B understands 
that A would be inconveniencing C too much to ask C to make the trip to Bretagne. When B hears A say, 


GRICE’S MAXIMS RECAST 21 


“That is a bit far to ask him to go,” B understands that A’s understanding has been affected; that is, the 
expectation has been fulfilled. 


CONCLUDING REMARKS. Grice’s maxims cannot be taken as maxims, per se. They do not have a rule- 
like quality. Speakers can use their understanding of what a hearer understands in order to try to change the 
hearer’s understanding. As the examples above show, this includes the speaker’s understanding of the 
hearer’s view of the speaker as a trustworthy or untrustworthy source. The speaker can attempt to make the 
hearer’s understanding match their own or can attempt to make the hearer’s understanding otherwise (in 
the latter case, the speaker is attempting deception). The speaker does not do this by the transmission of 
propositional information, but by speaking to change properties in the hearer, in the process exploiting 
hearer expectations. This paper has applied a Human Linguistics model of orthoconcepts (Yngve 2006, 
2007) to show that that we can deal in more practical terms with appropriate background conditions that 
Grice largely glosses over as if barely relevant. By referring to speaker expectations of the effect on a hearer’s 
understanding, I have shown how it is possible to achieve a unified account of those cases in which Grice 
claims Maxims are being fulfilled and those in which he claims his Maxims are left unfulfilled. Grice’s claims 
about violation, flouting, etc., of Maxims have been cited in the examples above; however, the HL model 
does not need to make normative claims about human behavior (the Cooperative Principle), nor does it 
need to establish a rule system (Grice’s Maxims) with built-in allowances for violating, flouting, opting out, 
etc. of the rule system. It instead provides mechanisms for analysis of what is said and how their speech is 
interpreted as a result of understandings that arise from the people being situated in a real-world context. 


REFERENCES 


BALCONI, M., & S. AMENTA. 2010. From pragmatics to neuropragmatics, In The neuropsychologyoft 
communication, ed. by M. Balconi, 93-109. New York: Springer Verlag. 

COLEMAN, D. W. 2011. Theory of Mind in Hard-Science Linguistics. LACUS forum 36:91-99. 

COLEMAN, D. W., & B. SYPNIEWSKI. 2017. Orthoconcepts: How the properties they represent arise. 
LACUS forum 37:261-268. 

DAWKINS, R. 1989. The selfish gene. Oxford, UK: Oxford University Press. 

GRICE, P. 1989. Studies in the ways of words. Cambridge, MA: Harvard University Press LEVINSON, S. C. 
2000. Presumptive meanings: The theory of generalized conversational implicature. Cambridge, MA: 
MIT Press. 

PREMACK, D., & G. WOODRUFF. 1978. Does the chimpanzee have a “theory of mind”? Behavioral and 
brain sciences 4:515-26. 

REYNOLDS, L., M. E. SMITH, J. BIRNHOLTZ, & J. HANCOCK. 2013. Butler lies from both sides: 
Actions and perceptions of unavailability management in texting. Paper presented at CSCW 7°13, 
February 23-27, 2013, San Antonio, TX. 

SMITH, M. E., J. BIRNHOLTZ, L. REYNOLDS, & J. HANCOCK. 2014. Everyday deception or a few prolific 
liars? The prevalence of lies in text messaging. Computers in human behavior 41: 220-227. 

TRIVERS, R. L. 1971. The evolution of reciprocal altruism. Quarterly review of biology, 46:35-57. 

YNGVE, V. H. 1996. From grammar to science: New foundations for general linguistics Philadelphia: John 
Benjamins. 


22 DOUGLAS W. COLEMAN 


---, 2006. Formalizing the observer in hard-science linguistics. LACUS forum 32:267- 276. 
---. 2007. People, orthoconcepts, and dialog. LACUS forum 34:313-319. 


This article was first published at lacus.weebly.com. 





LACUS Forum 41:3.23-33 (2020) 







LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free of 
charge and with no implied warranty. Its contents are available to you under the terms of the Creative 
Commons Attribution- NonCommercial license version 3.0; 

see Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





THE SYNTAX-PROSODY INTERFACE: A CASE STUDY OF ENUMERATIONS 
SVETLANA KAMINSKAIA 
University of Waterloo 


Abstract: The case study presented here focuses on the prosodic properties of enumerations produced by a 
female speaker of French. The purpose of this descriptive analysis based on spontaneous speech production 
is to determine whether this syntactic structure of enumerations has its own prosody or whether it shares 
characteristics with continuations. A detailed comparison of the rhythmic and intonational properties of 
enumeration and continuation phrases suggests that the main differences are related to FO (pitch) realization: 
the alignment of the beginning and end of the final rise as well as the slope: the valley and the peak are realized 
later in enumerations, and they have a less steep slope. These phonetic differences can potentially be 
expressed by different tonal specifications of the two types of phrases. 


Keywords: prosody, enumeration, continuation 


SPONTANEOUS SPEECH is inherently variable and therefore presents a major challenge for prosodic 
analysis. Frequent disruptions, hesitations, false starts, and incomplete utterances do not always allow a 
researcher to identify prosodic units without hesitation. Additionally, when they are identified, these units 
are difficult to compare, because identical structures are extremely rare in this type of production. Moreover, 
each speaker has their own voice quality and rate, which reduces the possibility of a direct comparison of 
prosody. In this context, enumerations appear to be perfect candidates for analysis: they are immediately 
recognizable (Fonagy & Magdics 1967, Lyudovyk & Robeiko 2007) and directly comparable (Pirker & 
Kramer 1999). However, do we recognize them because of a repeated prosodic pattern accompanying a 
repeated syntactic structure, or do they possess prosodic features that distinguish them from a regular 
utterance-medial (continuation) phrase? With two competing views, one suggesting that enumerations 
represent a specific stylized contour (Di Cristo 1998, among others) and “offer prosodic shapes where 
intonational components are easy to isolate” (Guaitella 1999: 156), and the other demonstrating with read 
data that they are variants of continuations (Portes et al. 2007, among others), it is important to explore this 
question not only to increase our knowledge about the syntax-prosody interface but also for speech synthesis 
and the prosody of spontaneous speech in general (Guaitella 1999). 

To explore the prosodic properties of enumerations, I examined recordings of a female native French 
speaker from the Troyes region in France and compared her production of enumerations with that of 
continuations. I examined the rate, syllable and phrase durations and a number of melodic aspects 
pertaining to the realization of the final rise, which in French corresponds to the primary accent: the 
alignment of the beginning and of the end of the rise, its scope, and the slopes. Statistical evaluation of the 
differences observed was not performed because of the limited set of tokens compared. I am aware of the 
limits of this case study, but if identified, the differences observed in the realization of the enumerations vs. 


24 SVETLANA KAMINSKAIA 


continuations of one speaker will contribute to the discussion of the prosody-syntax interface and 
intonational grammar in French. 


1. ABOUT ENUMERATIONS. Enumerations, or lists/listings, are often referred to as melodic clichés or 
stylized contours (Ladd 1978, 2008; Portes & Bertrand 2006). Other examples of melodic clichés are 
vocatives and teasing patterns (nananéres). These are characterized by a high degree of melodicity and 
syllable pitch stability (Di Cristo 1998: 215). Therefore, according to the research on clichés, enumeration 
intonation exists. Moreover, it can be applied to a text that does not have the morpho-syntactic or semantic 
properties of an enumeration, thus giving it a listing interpretation. In this case, it adds to the text of the 
message that ‘there is nothing new, ‘this is redundant’, ‘this is how things usually go: 


(1) Bon bien str 7 /c’est un film 7 extraordinaire 7 / c’est une documentation 7 / puis 7 on 
fait une interpretation 7 ... (adapted from Fonagy & Fonagy 1983) 


The role of intonation in (1) is to attribute the same importance to different parts of the utterance. The 
melody here belongs to the pragmatics rather than the grammar and has its own significance. Fonagy 
(1998:343) suggested the opposite: there is not a specific enumeration pattern, but an enumeration 
interpretation is instead achieved due to “a succession of parallel tones (rising or falling) on the terms, 
whether words, phrases or clauses”. Coustenoble & Armstrong (1934) earlier proposed the same, suggesting 
that it is a repetition of same intonational contours, no matter their shape, that is responsible for the 
interpretation of a list series. Fonagy (2003:3) discussed enumeration intonation as a return of a specific 
melodic motive that is concomitant and becomes distinctive only in the cases illustrated in (1). 

Concerning the intonation of enumerations, in many languages, this is described as a succession of 
rising tones (for middle members of an open list) and a final fall that concludes the series. Consider, for 
example, English (2a) and Byelorussian (2b): 


(2) a. apples 7, bananas 7, peaches 7, oranges Zand lemons \ 
b. mami 7, Taty 7, Opata 7, OadynHo7 Ta j3aysKO\ 
mom- ACC., dad- ACC., brother- ACC., nana - ACC., and grandpa-ACC. 


In French, the default enumeration pattern, “the one that conveys the bare meaning of listing, with no 
additional semantic or pragmatic charges or nuances” (Estebas i Vilaplana 1999: 647), is also a rising 
contour. However, it is often referred to as a continuation contour, the one that is found on a phrase in the 
middle of a long declarative sentence. For example, Faure et al. (1976) suggested that a declarative sentence 
pattern can apply for an enumerative sentence, with rising contours accompanying all but the final list item, 
which is pronounced with a falling intonation. Because not all rising tones in a declarative sentence are 
equal due to syntactic and prosodic hierarchy, some phoneticians specify that for an enumeration series, all 
rises must attain the same level and pauses may be added to demarcate each item from the next. This gives 
the items equality and individuality: 


These statements consist of a series of rises in pitch, all of which attain approximately the same 
level. The overall effect is that all items mentioned appear to be of equal importance. After each 
continuation pattern, there is a rather short but very important pause which serves to demarcate 
each individual item fromthe next. In this way, items maintain their individuality as well as their 
equality with respect to one another. (Lian 1980: 53) 


ENUMERATIONS AND THE SYNTAX-PROSODY INTERFACE a5 





Al tAll 43 GE 
pres ether thet Ei Tok 
ae Picci ris 


| SR es 
Pel ba eT Ee LT 

Figure 1. Chez le fruitier (B) on trouve des pommes (D), des oranges (F), des bananes (H), 
des fraises (J) et des abricots (L). (taken from Lian 1980: 55) 







At the same time, Portes et al. (2007:156) suggested that “some continuation rises ‘sound different’ when 
occurring on members of a list,” whereas Portes and Bertrand (2006:10) stated that enumerations “appear 
to be a very specific kind of non-final contour, different from usual continuations.” 

The default enumeration pattern in French is also referred to as an unmarked or ordinary enumeration, 
which is deprived of vivacity and emotion. This is illustrated in Figure 2A, where the final syllable of an 
enumeration item is realized with a higher pitch (or FO, fundamental frequency) than the preceding ones, 
but the overall rise lacks importance and is realized as a “flat” contour. Figure 2B, conversely, illustrates an 
emotionally marked enumeration, where the final rise is realized as a rising curve, reaching higher FO values 
than in Figure 2A. 

Other melodic patterns found in French enumerations are the pedagogical, or journalistic style (Lucci 
1979) and a tune of minor enumeration (Stocker 1924). The first is characterized by a melodic peak on the 
first syllable (Figure 3), and the second has “a marked rise of the voice on an unaccented syllable, i.e. the 
penultimate of each group” (Stocker 1924: 109), which is followed by a melodic fall (3). 


A. B. 


0.640694241 1.14677172 33923702984 
30044 


f~ 
Leia Renee Me —~ oS 


Pitch (Hz) 


Pitch (Hz) 








concert culture chez la belle-mére chez 





Time (s) Time (8) 
Figure 2. Unemotional (A) and emotionally marked (B) enumeration in French. 


41.4486112 42.0236112 
30074 : 


Pitch (Hz) 





Time (s) 
Figure 3. Example of a pedagogical-style enumeration. 


26 SVETLANA KAMINSKAIA 


(3) Il y a des gens comme Soi, COMMS ma fshinis-comme moi. 


(adapted from Stocker 1924:111) 


The last two patterns are obviously different and stand out as phonostylistic ones that require a specific 
context to occur. The default rising enumeration contours are more frequent, and they are the focus of the 
current analysis. Its main question about prosodic properties specific to enumeration is answered by 
comparing emotionally marked and unmarked enumeration both between themselves and with 
emotionally marked and unmarked continuations, respectively. Both expressive types are included because 
of the spontaneous nature of the data analyzed, which are rarely emotionally neutral. 


2. METHODOLOGY. The data under study come from a 27-year-old French female residing near 
Troyes in southern part of the Champagne-Ardenne region. The recording analyzed is a conversation 
between the participant and a close friend and the interviewer; it was conducted in 2003. The duration of 
the interview is 1202 sec, and the total of the speaker's participation in this interaction is 316.35 sec. She 
produced 530 stress groups, among which 359 have non-final rising contours. Twenty-two of them were 
enumerations: 18 were emotionally marked, and 4 were emotionally unmarked. These were compared with 
38 randomly selected continuation contours, 19 of each type of expressivity. 

The syntactic structure of enumerations varied: nominal phrases (e.g., “mairie, église”), prepositional 
phrases (e.g., “chez la belle-mére, chez la mére”), and adverbial phrases (e.g., “plein de choses, plein de 
magasins’), among others. 

The acoustic analysis was performed using the Praat software (Boersma & Weenink 2011). First, the 
signal was phrased into stress groups (or Accentual Phrases, APs [Jun & Fougeron 2000, 2002]), the minimal 
prosodic units within which a melodic contour is realized. Then, the selected enumerations and 
continuations were segmented into syllables and phones, and the following landmarks were tagged: AP 
boundaries, final stressed syllable boundaries, final stressed vowel boundaries, the beginning of the final 
rise (L), and the end of the final rise - the melodic peak (H). The FO and time values corresponding to these 
tags were extracted, and the following calculations were performed: 


- Duration of each AP; 

- Rate: based on the number of syllables and the duration of each 
AP; the articulation rate was calculated as number of syllables 
uttered per second; 

- Stressed syllable duration; 

- Proportion of the duration of the final syllable with respect to the 
durationof the AP; 

- Rise time between L and H; 

- FO excursion (difference between FO values of H tone and L tone); 

- LH slopes (FO excursion divided by the rise time); 

- Alignment of L tone with respect to the beginning of the stressed 
vowel(time distance between the L tone and the vowel start); 

- Alignment of H tone with respect to the end of the stressed 
vowel (time distance between vowel end and the H tone). 


ENUMERATIONS AND THE SYNTAX-PROSODY INTERFACE Zt 


The alignment of L and H was calculated with respect to different landmarks - the beginning and the end of 
stressed vowels, respectively — to minimize variability, because the L tone is realized near the beginning of 
the vowel and the H is realized near its end. 

The interest in considering tonal alignment is motivated by the fact that this phonetic parameter 
demonstrated a discriminatory power with respect to languages, dialects, pragmatic and discursive contexts. 
For example, FO peak alignment allows for differentiation between declarative and interrogative utterances 
in Neapolitan Italian (D’Imperio 2000, 2002), among dialects of English, Dutch, German, French, and 
others (Ladd et al. 2009), between pragmatic meanings of uncertainty and assertion (Pierrehumbert & Steele 
1989), and between nuclear and prenuclear stresses (Ladd et al. 2009, among others). To our knowledge, 
this phonetic aspect of FO realization has not been explored in studies addressing enumerations. | 
hypothesize that this very sensitive prosodic parameter can be helpful for a comparison of enumerations 
with continuations. 


3. RESULTS. The results of the analysis are summarized in Table 1. With respect to rate, one notes 
somewhat of a continuum between marked enumerations that have the slowest rate (3.26 syll/sec) and 
marked continuations that have the fastest rate (5.99 syll/sec). Overall, the enumerations had a slower rate 
than the continuations (3.26 and 3.96 syll/sec vs. 3.82 and syll/sec). Additionally, the emotionally marked 
phrases were uttered with a slower rate than were the emotionally unmarked: 3.26 and 3.82 syll/sec vs. 3.96 
and 5.99 syll/sec, for enumerations and continuations in each pair, respectively. However, no specific 


pattern appears because the distributions of rate values overlap (Figure 4). 
H-Vend, 
sec 


Table 1. Results of the analysis. 


Rate, % of Rise Excursion, Slope, L-Vstart, 
syll/sec final time, sec Hz Hz/sec sec 


enumerations | 0.90 | 15.65 | 0.16 | 58.24 | 134.78 | 0.07 
enumerations 


| 0.06 

continuations | 0.99 | 13.54 | 0.09 | 68.40 | 366.60 | 0.04 
Unmarked [$9914.22 | _out_1_6.22__1_47642_/ 0.04 
continuations | 1.27 | 17.80 | 0.09 | 50.62 | 346.54 | 0.06 








rate 
w 
ob bk 
a 
o 
& 


54 
@ 


2.24f2 


1 2 3 4 
phrase type 


Figure 4. Rate (syll/sec) by the phrase type: 1 = marked enumerations; 2 = unmarked enumerations; 
3 = marked continuations; 4 = unmarked continuations. 


28 SVETLANA KAMINSKAIA 


Because the structure and duration of the analyzed phrases varied, I report here only the results for the 
proportion of the duration of the final syllable with respect to the duration of the phrase (Table 1). It is not 
surprising to find higher values, and thus longer syllables, for emotionally coloured phrases (61.37% and 
51.63% for enumerations and continuations) than for unemotional phrases (46.33% and 42.22% for 
enumerations and continuations). Emotionally marked enumerations have the highest value, and the 
unmarked continuations have the lowest. These results again form a sort of a continuum that is a mirror 
image of the one observed for rate. This is only natural because the duration of the sounds is negatively 
correlated with the speaking rate. 

Examine the distribution of the values of the proportion of the last syllable (Figure 5). The emotionally 
marked enumerations and unmarked continuations stand out as not overlapping. The emotionally 
unmarked enumerations and marked continuations occupy the same space and show similar medians. 





100.0000 


80.0000 


60.0000 


"Of final 


40.0000 
20.0000 


.o000 
1 2 3 4 


phrase type 


Figure 5. Proportion (%) of the duration of the stressed syllable re the AP, by phrase type: 1 = marked 
enumerations; 2 = unmarked enumerations; 3 = marked continuations; 4 = unmarked continuations. 


The intervals between the start and the end of the final rise (the rise time) vary greatly: both types of 
enumeration show the largest differences in terms of the mean (Table 1) and distribution (Figure 7): 0.35 
sec (marked) and 0.02 sec (unmarked). The rise time in continuations showed closer values: 0.21 sec and 
0.14 sec, with overlapping distributions (Figure 6). 





.8000 


.6000 


54 
4000 * 
47 
* 
.2000 
= 


.0000 


rise time 


1 2 3 4 


phrase type 
Figure 6. Rise time (sec) L to H, by the phrase type: 1 = marked enumerations; 2 = unmarked 


enumerations; 3 = marked continuations; 4 = unmarked continuations. 


ENUMERATIONS AND THE SYNTAX-PROSODY INTERFACE 29 


The mean values of excursion, or the sizes of the rise from L to H, also vary considerably. However, 
overall, the excursion is larger in continuations than in enumerations, with the largest value found in 
emotionally marked continuations (129.62 Hz); unmarked enumerations had almost no rise within a vowel 
(4.98 Hz) (Table 1, Figure 1). 


300.0000 14 
* 


55 
a 


200.0000 


excursion 


100.0000 


ee 
1 2 3 4 


.0000 


phrase type 


Figure 7. FO excursion (Hz) L to H, by the phrase type: 1 = marked enumerations; 
2 = unmarked enumerations; 3 = marked continuations;4 = unmarked continuations. 


From the rise time and the excursion, slope values were calculated to reveal that the enumeration slopes 
are considerably less steep than the continuation slopes are: 286.95 Hz/sec and 275.56 Hz/sec vs. 680.64 
Hz/sec and 476.42 Hz/sec (Table 1). This difference is also accompanied by a minimal overlap in the 
distribution of the slope values for enumerations and continuations (Figure 8). 





Slope 





phrase type 


Figure 8. LH slopes (Hz/sec), by the phrase type: 1 = marked enumerations; 
2 = unmarked enumerations; 3 = marked continuations; 4 = unmarked continuations. 


The latency of L with respect to a stressed vowel beginning was calculated as the time difference between 
the moment of realization of the low target and the beginning of the vowel (L—Vstart) (Table 1). Thus, 
positive values indicate that the rise starts before the vowel onset, and negative values indicate that the rise 
begins after onset. In Table 1, only emotionally unmarked continuations have a negative value (-0.04 sec), 
whereas the marked ones have a positive value (0.03 sec), as do both types of enumeration (0.05 sec and 0.06 


30 


sec). In the examination of the frequency of occurrence of the intervals (Figure 9), one notes that 
enumerations have wider latencies (from 0 to 0.2 sec), whereas continuations are realized closer to the 
vowel beginning (mainly between -0.05 and 0.05 sec). This difference is schematically represented as in 
Figure 10, where it appears more clearly that continuations have the beginning of the rise (L) aligned earlier 


and in a more compact zone. 


A. Marked enumerations 


Frequency Percent 
tN 


-.1000 .0000 1000 .2000 


C. Marked continuations 


Frequency Percent 








SVETLANA KAMINSKAIA 


B. Unmarked enumerations 


a 
o 


40 


Frequency Percent 





0000 0500 1000 





1500 


D. Unmarked continuations 





No 
o 


Frequency Percent 


= 
o 





-.0500 .0000 


Figure 9. Alignment of L tone preceding the AP final rise relative the beginning of the stressed vowel. 


Unmarked enumeration 


Unmarked continuation 






Dn 





Marked continuation 


Figure 10. Alignment of L with respect to the beginning of the stressed vowel (t = 0 sec). 


ENUMERATIONS AND THE SYNTAX-PROSODY INTERFACE 31 


Finally, the latency of H tone with respect to the vowel end was calculated as the time difference between 
the moment of realization of H and the end of the vowel (H-Vend) (Table 1). All positive values in this table 
suggest that in all types of phrase, the H tone appeared after the vowel offset. Thus, the alignment of the peak 
in unemotional enumerations is the farthest from vowel offset compared with the other three types of phrase 
(0.07 sec vs. 0.04 sec and 0.03 sec). The distribution of the latencies (Figure 11) is summarized in Figure 12, 
where the spread of the alignment of the peak appears wider in enumerations and very condensed in 
continuations, particularly in emotionally marked continuations. 


A. Marked enumerations B. Unmarked enumerations 


50 


rs 
o 


Frequency Percent 
3 


20 


Frequency Percent 





C. Marked continuations D. Unmarked continuations 


40 30 











w 
o 
Nv 
o 


8 
Frequency Percent 


Frequency Percent 


= 
o 


Figure 11. Alignment of H relative the end of the stressed vowel. 


Marked enumeration 
Unmarked enumeration 





Unmarked finuah 
Marked continuation Patna 


Figure 12. Alignment of H with respect to the end of the stressed vowel (t = 0 sec). 


52, SVETLANA KAMINSKAIA 


CONCLUSIONS. This case study explored the prosodic characteristics of enumerations by comparing 
them with continuations to determine whether they have specific prosodic properties. Due to the 
spontaneous nature of the data analyzed, I considered emotionally neutral and emotionally marked 
continuations and a series of enumerations and compared the following prosodic parameters: rate, 
proportion of the duration of the stressed vowel with respect to the duration of the phrase, and the final 
intonational rise (rise time, rise size, slope, and alignment of L and H tones). 

The results suggest that enumerations are uttered at a slower rate and that the proportion of their final 
syllable is larger in comparison with continuations. However, the FO rise time did not reveal any tendency, 
though both the valley (L) and the peak (H) are realized later in enumerative APs. The FO excursion 
appeared larger in continuations, which contributed to the differences observed for the slopes: they are less 
steep in enumerations. This observation confirms the findings of Portes et al. (2007:160-61), who also found 
that “[t]he pertinent ‘specific formal feature’ [of enumerations] is a smaller slope of the rise.” Based on this 
finding, the authors concluded that “the rising contour appearing in enumeration context [...] can be 
considered as a subclass of continuation rise ...” and that “the list rise corresponds to the stylised version of 
the continuation rise.” 

The alignment of both L and H targets also appears to contribute to the distinction between the two 
types of phrase compared. So, with respect to the stressed vowel beginning, the valley is realized later in 
enumerations; the peak in these phrases also appears later, though, with respect to vowel ending. So, the 
alignment of the tones demonstrates a discursive value. The importance of these findings should be verified 
on a larger dataset and with perception tests, because if these results are confirmed, such a difference in 
tonal alignment should be reflected in tonal structure: LH* notation would describe the rise in a 
shows the tone primarily 
shows the tone aiming at the boundary following the phrase). Such notations 


C499 


continuation contour, and L*H- would describe an enumeration rise (where 
realized on the vowel, and “-” 
are largely used in intonational phonology in different languages but are typically not used in French, where 
the shape of the rise is not known to contribute to the meaning. Different tonal specifications based on 
associations with different landmarks —- whether segmental or phrasal - would translate both prosodic and 
syntactic differences that are proper to continuations vs. enumerations. Studies aiming at a detailed 
examination of FO with an eye toward pragmatic and discursive meaning can refine the intonational 


grammar of the French language. 


REFERENCES 


BOERSMA, P., & D. WEENINK. 2011. PRAAT: Doing phonetics by computer. Computer program. 
Retrieved from http://www.praat.org. 

COUSTENOBLE,H.N.,&L.E. ARMSTRONG. 1934. Studiesin French intonation. Cambridge: Heffer and 
Sons. 

D’IMPERIO, M. 2000. The Role of Perception in Defining Tonal Targets and their Alignment. PhD 
dissertation. Ohio State University. 

---, 2002. Language-specific and universal constraints on tonal alignment: The nature of targets and 
“anchors”. In Proceedings of Speech Prosody 2002, ed. by B. Bel and I. Marlien, 101-106. Aix-en- 
Provence: Laboratoire Parole et Langage. 

DICRISTO, A. 1998. Intonation in French. In Jntonation Systems: A survey oftwenty languages, ed. by D. 
Hirst and A. Di Cristo, 195-218. Cambridge: Cambridge University Press. 


ENUMERATIONS AND THE SYNTAX-PROSODY INTERFACE 33 


ESTEBAS I. VILAPLANA, E. 1999. The intonation of enumerations: A cross-linguistic study. In Actas des 
XXI Congresso Internacional de Asociacion Espanola de Estudios Anglo-Norteamericanos, ed. by F. T. 
Iglesia, J. A. P. Pablo, M. J. Mora, and T. L. Soto, 645- 652. Sevilla: Universidad de Sevilla. 

FAURE, G., A. DI CRISTO, & P. DURAND. 1976. La phonétique corrective dans l’enseignement du 
francais langue étrangere. Le francais dans le monde22: 27 - 43. 

FONAGY, I. 1998. Intonation in Hungarian. In /ntonation Systems: a survey of twenty languages, ed. by D. 
Hirst, and A. Di Cristo, 328-44. Cambridge: Cambridge University Press. 

---, 2003. Des fonctions de l’intonation : Essai de synthése. Flambeau 29: 2-20. 

---, &J. FONAGY. 1983. L’intonation et organisation du discours. Bulletin dela Société de Linguistique de 
Paris 78 (1):161-209. 

---, & K. MADGICS. 1967. The intonation of Hungarian. Budapest: Akadémiai kiado. 

GUAITELLA, I. 1999. Réitération et énumérations : approches des fonctions discursives et interactives de 
lintonation. Faits de langues 13: 150-56. 

JUN, S.-A., & S. FOUGERON. 2000. A phonological model of French intonation. In Jntonation, ed. by A. 
Botinis, 209-242. Dordrecht: Kluwer Academic Publishers. 

--- & ---, 2002. Realizations of accentual phrase in French intonation. Probus 14: 147-72. 

LADD, R. 1978. Stylized intonation. Language 54: 517-540. 

---.2008. Intonationalphonology(Second edition). Cambridge: Cambridge University Press. 

---, A. SCHEPMAN, L. WHITE, L. M. QUARMBY, & R. STACKHOUSE. 2009. Structural and 
dialectal effects on pitch peak alignment in two varieties of British English. Journal of Phonetics 
37:145-61. 

LIAN, ANDREW-PETER. 1980. Jntonation patterns in French, teacher's book. Sydney: River Sein Publications. 

LUCCI, VINCENT. 1979. L’accent didactique. In L’accenten fran¢ais contemporain, ed. byI. Fonagy and P. 
Léon. Studia Phonetica 15: 107-122. 

LYUDOVYK, T. & V. ROBEIKO. 2007. Inventory of intonation contours for text-to- speech synthesis. 


Proceedings of the 62 ISCA WorkshoponSpeechSynthesis, Bonn 2007, ed. by P. Wagner, J. Abresch, S. 
Breuer, and W. Hess, 368-373. Bonn: Universitat Bonn. 

PIERREHUMBERT, J. B., & S. STEELE. 1989. Categories of tonal alignment in English. Phonetica 46: 181- 
96. 

PIRKER, H., & KRAMER, S. 1999. Listening to lists: Studying durational phenomena in enumerations. 


Proceedingsotthe 14 th 


internationalconference ofphoneticsciences, ed. by J. Ohala, 273-276. San Francisco: 
University of California. 

PORTES, C. & R. BERTRAND. 2006. Some cues about the interactional value of the “continuation” 
contour in French. In Proceedings of IDP05 (Interface-Discourse- Prosody), ed. by A. Di Cristo et al., 
8-9 October 2005, Aix-en-Provence, France. CD- ROM. 

---, R. BERTRAND, & R. ESPESSER. 2007. Contribution to a grammar of intonation in French: Form and 
function of three rising patterns. Nouveaux cahiers de linguistique francaise 28: 155-62. 

SCHEPMAN, A., R. LICKELY, & R. LADD. 2006. Effects of vowel length and « right context » on the 
alignment of Dutch nuclear accents. Journal of Phonetics 34: 1-28. STOCKER, C. 1924. The speech- 
tune of minor enumeration: A study in French tonetics. The modern /anguage journal 9 (2): 107-114. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:3.34-42 (2020) 


LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





ANCIENT INDIAN GRAMMARS AS DOORWAYS TO FREEDOM 
KAREN/SRUTI MALCOLM 
University of Winnipeg 


Abstract: Although the ancient Indian grammarian Panini has been called India’s Euclid and his 
grammatical treatise the Astadhyayi“a monumental work of human intelligence” by Bloomfield (1933:11); 
although Dineen calls Panini the Indian grammarian whose description of Sanskrit revolutionized western 
linguistics in the nineteenth century; although Panini’s grammar is considered the most exhaustive and 
complete statement of the rules of word formation in any Indo-European language and is said to have 
influenced twentieth century linguists like Bloomfield, Saussure, Jakobson, Firth and all subsequent 
grammarians, there is seldom any mention of the communicative purpose of Panini’s grammar, or of the 
details of its social and cultural context that would ensure appropriate interpretation. Panini’s work as well 
as that of other ancient Indian grammarians has been decontextualized and depersonalized as a scientific and 
formal system of grammar in a way that is quite different from the way that it was intended. 


Keywords: Ancient Indian Grammarians, Panini, Bhartrhari, Abhinavagupta 


Languages: English, Sanskrit 


PANINI’S GRAMMAR WAS FIRST AND FOREMOST INTENDED as a prescriptive support for Vedic 
study, performance, and interpretation. In fact, all India’s early grammarians, from 5" century BC Panini 
to 11" century CE Abhinavagupta, wrote their great treatises and commentaries for this same reason. 
However, a knowledge of language and grammar was much more important than maintaining a golden 
mean of linguistic performance. It was considered necessary to further an individual’s quest for moksha 
‘liberation’. And symbolically, these early grammars were thought to model the nature of reality and creation, 
an expansive vision. In a commentary on Abhinavagupta’s phonemic emanation theory, Padoux (1990:xi) 
writes “Grammar is the gateway to salvation.” Somehow this very real sentiment, common to all ancient 
Indian grammarians, seems to have been lost in translation. In this paper, the ancient grammars of India are 
recontextualized in terms of the breadth and depth of their communicative intention and their literary and 
philosophical context. 

The more I understand the ancient world of the Vedic grammarians, the more I begin to realize the 
complexity of the philosophy which grounded them in all their scholarly pursuits. According to Biardeau, 
Indian thinkers always try to immerse the objects of their studies in a wider philosophy, a global view of the 
universe (Isayeva 1997:69). Their mythology bridges worlds of science, literature, metaphysics and culture. 
People of Vedic times were concerned with the nature of reality, the relationship between being, 


ANCIENT INDIAN GRAMMARS AS DOORWAYS TO FREEDOM 35 


consciousness, and language (ontology) and a theory of knowledge that considered valid means of cognition 
and methods of logic and inference (epistemology). 

Almost everything said by the ancient grammarians needs to be interpreted on multiple levels: on the 
highest level concerning their belief in Absolute consciousness (sometimes personified theistically as 
Brahman or later, Siva); macrocosmically, on the level of the universe and everything within it, and 
microcosmically, on the level of the human body and human consciousness. Vedic scholars believed that 
the evolution of the universe and the evolution of human consciousness reflected Absolute consciousness, 
symbolically represented by the Sanskrit language. And these evolutionary pathways when reversed 
mapped the path of involution that guided the yogi ‘back’ through the distraction of the variegated universe 
to the unitary source place: the creative matrix from whichitall began. The ancient Vedas of India say that 
Brahma gave the language of Sanskritto Brhaspati, who gave each word to Indra over a period of a 
thousand years (Ray 2004:2, 32; Rath 2004:52). According to the Taittiriya Samhita, at this point vak 
‘speech’, was still avyakrta ‘undiscriminated’, a potential not yet manifested, so the gods asked Indra to 
differentiate it, language by language, word by word. Doing so, he was considered the first purifier of 
grammar. Hence, Vedic words were believed to come from the mouth of God (Rath 2004:44). Indra 
passed this knowledge on to the ancient rsis ‘seers’, who were responsible for seeing / hearing the words of 
the Vedas ‘revealed scriptures’. 

Because the Vedic scriptures were believed to come directly from God, their preservation was regarded 
as a religious duty (Banerji 1996:18). Some believed the Vedas had been composed in code, so every detail 
was meticulously preserved (Flood 1996:40). The oral tradition was passed down by sophisticated 
techniques of memorization: word by word, word groups, words recited in reverse, words involving 
complicated series of combinations, words remembered in sentences, words accompanied by hand gestures. 
When linguistic knowledge and precision were so highly valued, it was not surprising that the Vedic world 
was a fertile ground for spawning grammars. Of the six vedangas ‘systems of knowledge supporting the 
Vedas’, four focused on language. In order to teach their students the correct pronunciation recited at 
exactly the prescribed pitch, Vedic teachers taught siksa “phonetics —precise and perfect pronunciation, 
syllabification, and stress, vyakarana’grammar’, nirukta‘etymology , chanda‘prosody’,, kalpa‘the details of 
ritual’ and /yotisa ‘astronomy the right time for rituals (Rath 2004:49-50). 

The Atharva Veda speaks of twelve schools of Sanskrit grammar including over a thousand works on 
grammar by some three hundred grammarians (Ray 2004:7-8, cf. Banerji 1996). Grammar included the 
analysis of words into their constituent parts: base, suffix, augments and modifications (Nene 2004:63). 
Grammarians came up with rules explaining the derivation of words, their etymology, and their meaning. 
In Vedic times grammar was mapped onto the body of humans or animals: tense was mapped onto feet, 
inflections onto hands; unaspirated voiced stops belonged to the goat, nasal labials to the crane; nasal dentals, 
the elephant (Rath 2004:42). 

Grammarians were descriptivist in their linguistic practice; however, their work was used prescriptively 
to ensure the precision of the oral tradition. If sacred mantras were mispronounced, not only would the 
purity of the oral tradition be tainted, but the vibrational power of these sacred words would be destroyed. 
Mispronunciation could spoil not only a costly ritual, but lead a priest to losing his position, a Brahman his 
caste, or even death. Through their recitation Brahmans believed not only that they gained personal merit, 
but also that they contributed to the sacred order of everything. Sacred texts played arole in the “creation and 
correct maintaining of the universe, ... they were to safeguard the continuity and stability of natural laws 
and the possibility of attaining Brahman and liberation, ... even the syllables of the Vedas acted as a 
foundation for ordinary natural phenomena” (Isayeva 1997:70). The Yajur Veda was “a Scripture whose 
hymns ... and rules for correct performance were said to control the harmonious functioning of the 
universe” and the Atharva Veda consisted of “prayers for the healing of disease and the restoration of 


36 KAREN/SRUTI MALCOLM 


harmony in the world” (Chidvilasananda 1999:162, 167). 


1. PANINI. Most scholars date Panini between 500-400 BC (Ray 2004:24-6, Banerji 1996:122). Little is 
known of his life: he is thought to have come from north-west India, modern day Pakistan. He created one 
of the earliest Sanskrit grammars, although he mentions earlier grammarians in his work (Ray 2004:4, 8; 
Rath 2004:45). His famous grammatical treatise, called the Ashtadhyayi by Mimamsaka, including nearly 
four thousand sutras ‘short verses,’ gives a standard analytical description of Sanskrit (Nene 2004:64). 
Paninis grammar was a highly systematised, sequentially specific, series of rules that claimed to describe 
Sanskrit morphology completely, without redundancy, using a technical language relating to morphology, 
syntax and lexis, and including concepts currently known as phonemes, morphemes, and roots (Banerji 
1996:148). His treatise was divided into eight chapters, each further divided into four padas‘sections’. In the 
first chapter Panini introduced the samjna technical features’ of his grammar. In chapters four and five he 
talked about suffixes and roots, and in his later chapters he discussed the way sandAi‘a continuous recitation 
of words in sentences’ affected articulation. The central and most complex part of Panini’s work used 
material from existing lexical lists and described their derivational rules either explicitly or through 
deduction. Panini’s basic premise was that grammar is the relation between sabda ‘linguistic forms’ and 
their artha ‘meanings’, which is fixed, permanent and nitya ‘eternal’ (Iwasaki 2004:96). 

Although Panini gave the world an exhaustive Sanskrit grammar including a wealth of phonological 
and morphological detail, the purpose of such early grammars was not intellectual knowledge for the sake 
of knowledge. Well aware of varied temporal, geographical and social dialects (each of the Vedas was to be 
used by a different social class, and the hymns within them recited in different ways by each [some silently, 
some loudly]), Panini wrote his grammar as a prescriptive support for Vedic study, performance and 
interpretation; more specifically, to establish a prescriptive standard of correct usage (Rath 2004:49-50). This 
had little to do with cultivating a golden mean for social prestige, but everything to do with the ancients’ 
belief that the words of the Vedas were revealed scriptures from God, whose performance must maintain 
the harmonious running of the universe and everything within it. 


2. KATYAYANI. Katyayani lived in southern India between 300 and 250 BC (Iwasaki 2004:98). In the over 
four thousand verses of his Varttikas, Katyayani was the first commentator on Panini (Ray 2004:26-30). 
He declared some parts of certain sutras redundant and added sutras that he felt Panini had left uncovered 
(Kulkarni 2004:119-120). He examined the inadequacies and omissions in Panini’s work in terms of the 
language of his time in order to update Panini’s work (Iwasaki 2004:98). Some speculate that Katyayani and 
the later Patanjali considered Panini’s grammar a sruti ‘revealed scripture’ in itself. It was considered the 
authority on the laws of correct word formation, and they saw their role to make whatever modifications 
were necessary in order to keep it current (Kulkarni 2004:118). By Katyayani’s time, some words had become 
obsolete, others carried different meanings; some derivations had changed. Some new sounds did not exist 
in the Vedas, but probably developed from Dravidian influence. Katyayani was the first to realize that the 
essential part of the vakya‘sentence’ was the verb, which Panini hadn’t defined (Kulkarni 2004:123-5). 

Contrary to Panini’s primarily scientific descriptive perspective, Katyayana’s work was more 
philosophical (Iwasaki 2004:99, Kulkarni 2004:121). Indian grammarians attempted to link purely technical 
rules with notions pertaining to the structure and origin of the universe, world, and nature (Isayeva 
1997:70). In India language became not only an instrument for the expression of thought, but also a means 
for knowing the highest reality and experiencing or merging with it (ibid). 

In orthodox Indian tradition there were two theories of causality, each held by a different school: 1) an 
effect pre-exists in its cause in a latent state, so creation presented a gradual unfolding of manifestation: the 


ANCIENT INDIAN GRAMMARS AS DOORWAYS TO FREEDOM 37 


evolution of prakrti‘nature’, or 2) an effect was born suddenly, as something absolutely new, different from 
its cause. Like Panini, Katyayana agreed with the former perspective: things could not be destroyed, nor new 
things come into being (Kulkarni 2004:121). He saw the word as eternal, permanent (Iwasaki 2004:96). He 
said the relationship between a word and its meaning was constant, and the study of grammar should 
proceed from that (Kulkarni 2004:122). However, he noted that word use changes over time (which may 
have been of concern to Vedic practitioners who sought to uphold the precision of their rituals and mantras 
(Rath 2004:49)), thus the purpose of grammar prescriptively was to provide a standard of right usage so the 
relationship between meaning and form would not be corrupted. A grammar was to uphold the most 
“meritorious” usage, and “restrict” others (Kulkarni 2004:124). 


3. PATANJALI. Scholars date Patanjali between 200-150 BC (Ray 2004:32). Patanjali’s encyclopaedic 
Mahabhasya described the geography, cities, literature, poetry, and theatre of his time as well as grammar 
(Rath 2004:56). Ray (2004:33) considers it a storehouse of ancient India concerning its politics, economy, 
literature, philosophy, and scientific matters. It offered both a database of cultural data and new methods of 
interpretation. Some call the Mahabhasyathe great commentary on Panini’s Ashtadhyayi, others point out 
that it spent more time paraphrasing and commenting on Katyayani’s Varttikas, approving of some sutras, 
rejecting others, supplementing more (Ray 2004:31, Iwasaki 2004:98). Patanjali’s work was divided into 
eight books, each divided into 4 padas (following Panini’s grammar), each with several ahnikas 
‘sections (Iwasaki 2004:100). These daily lessons may have been used in the classroom. Patanjali composed 
his treatise in the form of dialogues between teacher and student. Using colloquial language mixed with 
humour, he created a dialogue in which the student brought his questions and doubts to the teacher, the 
teacher explored relevant arguments, and then came to a conclusion that may or may not have been 
conclusive (Iwasaki 2004:100). Patanjali was interested in the why, not just the what (Kulkarni 2004:139). 
In the process of commenting on Panini’s grammar, he immortalized Panini’s oral text (Bhate 2004:132). 

Like Katyayani’s work, Patanjali’s Mahabhasya discusses subjects on the threshold of grammar and 
philosophy. In fact, Patanjali considered grammar instruction bereft of its philosophical foundation for the 
‘dull witted’ (Bhate 2004:142). Both Katyayana and Patanjali claimed that while one of the purposes of 
grammar was the religious merit obtained through the knowledge and the correct use of speech (Bhate 
2004:136), a knowledge of grammar was also the shortest way to attain not only the purification of speech, 
but also the ultimate liberation from samsara ‘cycle of birth and death’ and the transcendental union of the 
human Atman ‘Self/soul’ with Brahman (Isayeva 1997:71-2, Ray 2004:39-40, Bhate 2004:132). Patanjali saw 
himself as a sabda pramanaka ‘upholder of the authority of the word’ (Rath 2004:44). He transcended the 
boundaries of grammar by exalting his students to the realm of metaphysics (Bhate 2004:142). 


4. GUADAPADA. Later grammarians Bhartrhari and Abhinavgupta were influenced by the philosopher 
Gaudapada, who lived between the fifth and sixth century CE and who combined notions from early 
Vedanta and Buddhism. He was the first Vedantin to suggest an opposition between the highest reality of 
Brahman, devoid of attributes, and the multidimensional, relative, limited reality of humans, which he called 
maya (later defined as an ‘illusory dream’ by Advaitins) (Isayeva 1997:4). 

Commenting on the Mandukya Upanishad in his Manukya karika, Gaudapada talked of the fourfold 
structure of consciousness, the four aspects of the macrocosmic cosmic soul, and the four evolutionary steps 
leading microcosmic human consciousness to liberation from all limitation, all symbolized by the four 
phonetic elements of the sacred syllable /aum/ OM. In the ten books including more than a thousand hymns 
of the Rg Veda (dated about 1200 BC, though composed over several hundred years by sages associated with 
different families), the askara ‘sacred mantra’ OM meant that which does not perish, the eternal (Ray 


38 KAREN/SRUTI MALCOLM 


2004:13). Askara also referred to ‘syllable’: the smallest division of the word, that to which a word can be 
reduced and from which it emerges. 

Gaudapada’s first state of consciousness was the waking state, where human consciousness perceived 
external objects through its senses and powers of action. The second state was the dream state, where internal 
objects were produced by consciousness in the form of mental images, memories and dreams. Here 
“consciousness shines forth since its external envelope is torn away and the thin sheath of dream state is 
incapable of blocking out the inner light which shines forth from the depths of the Atman or Self” (Isayeva 
1997:21). The dream state used the same parts of the body and mind as the waking state, which like 
computer hardware was unconscious but held potentialities (ibid). In the third state, translated as deep 
sleep, the Atman ‘Self was in a conscious state where everything became indistinct. In this state it was as if 
the doors of perception were shut and consciousness had turned within to enjoy its own nature: the Atman 
was liberated from illusions and allowed to experience the bliss of the inner light of the Self (Isayeva 
1997:22). In the fourth state, the Atman or Self had merged with Brahman, and could no longer be grasped, 
defined, thought, or described. In this fourth state there was no distinction, no opposition between external 
and internal, light and dark, joy and anguish. The fourth state was the witness of everything and could only 
be experienced in meditation. These four states fit into each other like Russian wooden dolls writes Isayeva 
(1997:27), where each higher state completely absorbed and incorporated all the ones below, or inside, it. The 
fourth state supported everything else (ibid). 

Concerning the perennial issue of causality, while Panini and Katyayani believed the effect pre-existed 
in the cause in a latent state and others believed that effects were born suddenly, new and different from 
their causes, Gaudapada advocated a third position: that there was no creation at all, the world was illusion, 
and our perceptions were akin to dreams and illusions: they serve as empty signs for absent objects (Isayeva 
1997:37). For him, things with no beginning and no ending could not exist in the present, either. Later 
Advaitans believed that only things that never changed could be regarded as truly existing. This meant that 
all external objects of perception were unreal and false. The changing, fluid, diversifying of existence meant 
that the universe was illusory: all could be reduced to internal perception (Isayeva 1997:42). In the 
Upanishads, the only unchanging truth was the eternal Brahman; the world of external beings and objects 
was nothing but the projection of a sole and undivided consciousness Atman (Isayeva 1997:45). 


5. BHARTRHARL. As priest and grammarian, Bhartrhari transformed grammar into theology (650 CE) 
(Iwasaki 2004:103, Isayeva 1997:73). Like Gaudapada, Bhartrhari believed Brahman to be not only the 
unchanging, eternal, highest Brahman nirguna‘without/beyond attributes’ of orthodox Indian tradition, but 
also the dynamic, active, potent form, saguna ‘with attributes or powers’, which resembled a tightly coiled 
spring ready to unfold and release the entire universe (Isayeva 1997:5). Brahman was both indivisible and 
divisible, one and many, uniform and infinitely divided (Isayeva 1997:84). Bhartrhari avoided constructing 
Gaudapada’s cosmological and psychological hierarchies: he preferred to think of the Absolute in terms of 
energy (Isayeva 1997:6). Bhartrhari considered the essence of the Absolute to be sound; speech, a condensed 
form of Brahman; the sacred syllable OM, the bija ‘seed’ of the universe, the primordial mantra to which all 
forms of speech could be reduced (Isayeva 1997:17). In the early Rg Veda vak was considered either the 
primordial principle, the supreme power, equal to Brahman, or the supreme Goddess, the active, feminine, 
creative force that brings forth the world (later, Sakt7) (Rath 2004:45, Padoux 1990:5, 7, 10). Following late 
Upanishad practice, when Bhartrhari spoke of words and speech, he referred to transcendental/divine, not 
relative, speech which existed before and beyond manifested words. He saw language as the foundation of 
perception in the form of latent, unmanifested, primordial words which form the seeds of thought and 
consciousness. To him the word was silently present as potential in the beginning, and slowly transformed 


ANCIENT INDIAN GRAMMARS AS DOORWAYS TO FREEDOM 39 


itself into an actual, articulated word that could be pronounced and heard (Isayeva 1997:85). Language was 
like an envelope to organize non- verbal perceptions into a whole. In his Vakyapadiyahe wrote “This world is 
an evolution of the word” (VP1 120). Manifestation was an outcome of the pulsating seeds of vibrating 
consciousness being enlivened and coalesced into thoughts and objects. To Bhartrhari, “Brahman 
maintained the inner equilibrium of the Vedas, and language sustained the whole universe” (Isayeva 
1997:84). 

In the first book of his grammar Bhartrhari dealt with metaphysical and epistemological issues (Iwasaki 
2004:102). He introduced the notion of sphota. As someone looking at a painting first responds to it as a 
whole, not a collection of lines and colour, he saw sphota as a bursting forth of meaning, grasped suddenly, 
immediately, in a single act without any intermediary stages in the process of perception (Isayeva 1997:95). 
Bhartrhari believed that the meaning of a sentence was not an outcome of the cumulative sequence of 
individual word meanings, but rather an intuitive flash that revealed the uniform whole, shining forth once 
complete. As a consequence, in books two and three of his grammar, he declared vakya ‘the sentence’ the 
genuine unit of expression with its meaning bursting forth from the speaker’s prathiba ‘intuition’ (Iwasaki 
2004:103, Isayeva 1997:73). Bhartrhari saw sphota operating at all levels of language, from the most 
empirically concrete to the most subtle (Rath 200:47). The words Bhartrhari used meta-linguistically in his 
grammar denote profound metaphysical meanings simultaneously: sadhanvaka referred to ‘grammatical 
meaning’ or ‘spiritual journey, sankhya denoted ‘grammatical number’ while naming ‘a philosophical 
school’, purusa referred to ‘grammatical person’ and ‘the highestSelf’. 

Following from Gaudapada’s fourfold states of consciousness and their ongoing evolution, Bhartrhari 
considered vak ‘speech’ a multilayered phenomenon (Rath 2004:46). He distinguished four types of speech: 
vaikhari ‘manifested, articulated speech’ (ordinary speech) was associated with Gaudapada’s waking stage 
of consciousness. “Having assumed the form of the syllables, when the air strikes the points of articulation 
as it goes upward, vaikhari'manifested speech is related to the breathing activity of those who speak” (Isaveya 
1997:90). Vaikharispeech was considered a reflection of some preceding activity: before the speaker talks, he 
thinks of appropriate words and meanings. This mental pre-utterance level Bhartrhari called madhyama vak. 
This inner mental speech was composed of dhvani‘ideal, primordial ideas of sounds’ which were organized 
in terms of the Arama ‘sequence’, time and space, of vaikhari speech. “It is the speech that has not yet 
transformed itself into multitudes of articulated sounds and the corresponding multitude of objects, pasyanti 
vak.” Although Bhartrhari used the idea of sphotaat vaikhariand madhyamalevels of speech, he saw sphota 
based on intuition as truly belonging to pasyanti vak (Rath 2004:46). Pasyanti vak was not thought or 
remembered, but intuited, revealed, heard from a subtle plane within, as the ancient rsis ‘seers’ had 
heard/seen the Vedic srutis in meditation. Saivites consider pasyanti vak as a potency about to be realized. 
The fourth type of speech, para vak ‘Supreme speech’, was the foundation of all the other forms of speech 
as well as all theexisting universe. It was considered an indivisible unity, the creative matrix from which 
everything emanated (symbolized as OM, which means something resounding, throbbing, echoing in 
resonance). 

This is where twentieth century linguists have inadvertently led us astray, focusing on what Rath 
(2004:46) calls the vibrationally lowest, grossest, vaikhara level speech with no mention of the other levels 
that made vaikharilevel speech possible. “With their interest in the implications of sentence holism which 
they explore in referential and semantic theory, some linguists have misinterpreted a metaphysical truth as 
a psychological one” (Rath 2004:48). The goal of Indian grammarian philosophy was not just jnana 
‘intellectual knowledge, but vijnana ‘knowledge from authentic experience’ (Isayeva 1997:72). Bhartrhari 
believed that one had to study grammar to experience Brahman and attain enlightenment. 


40 KAREN/SRUTI MALCOLM 


6. ABHINAVAGUPTA. Bhartrhari’s philosophico-linguistic approach was much discussed in the work 
of the late eleventh century philosopher Abhinavagupta (Padoux 1990:3, Isayeva 1997). Abhinavagupta 
expounded the esoteric inner teachings of the Trika-Kaula tradition of Hindu Tantric Saivism (Foreword 
by Muller-Ortega in Singh 1988:xi, Raghavan 1981:40). His purpose in writing his Paratrisika Vivarana was 
jivanmukti ‘liberation while still alive’ (ibid). The subject matter of this text was the nature of Divine 
Consciousness (AHAM, I- consciousness), and the nature of the heart which conceals the innermost core 
of reality from which the entire cosmos emanates, pulsating into embodiment. This gradual unfolding, this 
process of emanation, Abhinavagupta described in thirty-six tattvas or principles that structure reality (a 
development of the earlier 28 tattvas). The highest tattva represented the undifferentiated Absolute, and 
each tattva that followed represented the next stage of differentiation, leading from potential and 
intentionality to actual manifestation and embodiment with all the senses and powers of action that define 
and limit our notion of ourselves and the relative world (cf. Raghavan 1981:38-50). 

Each tattva was symbolized by a particular Sanskrit vowel or consonant in what Padoux (1990:223-329) 
called Abhinavagupta’s phonemic emanation theory. Following Gaudapada and Bhartrhari among others, 
Abhinavagupta described the creative evolution both of the universe and human consciousness through the 
gradual emergence of fifty varnas ‘phonemes’ representing the matrka sakti‘mother energies’ of creation. 
Each phoneme represented the phonic aspect of universal energy which corresponded to a specific level of 
consciousness and a specific stage of cosmic process (Padoux 1990:50). The sixteen vowels of Sanskrit 
symbolized the different powers/Saktis of the Lord/Siva (although Sakti was not considered different from 
Siva); and each consonant corresponded to another tattva or principle (each tattva contains within it all 
other tattvas) (Muller-Ortega 1989:62-3). 

The way Abhinavagupta developed Bhartrhari’s notion of para vak ‘Supreme Speech’, became one of 
the most important theoretical tenets the Pratyabhijna school of Kashmir Saivism. For grammarians, 
language was the distinguishing feature of human consciousness which at its deepest level was identical with 
being. In Kashmir Saivism theories of language were considered theories of cit ‘consciousness’ (Isayeva 
1997:229). Abhinavagupta considered para vak Absolute consciousness: eternal, undifferentiated, 
indestructible, creative and completely free. Para vak was the divine creative energy, on the macroscopic 
plane, it brought about the phonematic emanation up to /a, the manifestation of the universe (Singh 
1988:174, Padoux 1990:6). On the microcosmic plane, it referred to that energy which enlivens the human 
body. The vibratory energy of the Word was the same energy that brought worlds into existence, maintained 
them, and subsequently reabsorbed them (Padoux 1990:x-xiii). The evolution of human consciousness was 
the microcosmic reflection of the macrocosmic embodiment of Divine consciousness into the relative world. 
The evolution of the primal sound-vibration in the cosmos and the movement of the Aunda/ini ‘spiritual 
energy in the body were considered reflections ofone another. 

As he mapped both the evolution of the universe within divine consciousness and the emergence of 
language (explicit thought) within human consciousness through the four levels of speech, Abhinavagupta 
also mapped the yogi’s return journey from the diversity and phonic variety of the relative world to the 
unity of the silent source place deep within the heart. The outward movement of evolution was reversed in 
the yogi’s inward movement, or involution, just as the energy of speech moved through stages that 
correspond to cosmic creation and resorption. With the cosmic flow of emanation, the highest energy of 
para vak lost its freedom until it was contracted in the limited word of human language subject to the rules 
and limitations that made it a source of bondage. However, if practitioners purified/raised the vibration of 
their vaikhari level language by chanting divine mantras revealed at pasyanti vak, they could experience 
Supreme consciousness and end the cycle of worldly existence. In other words, while vaikhara vakbound; para 
and pasyanti vakliberated (Padoux 1990:xii). 


ANCIENT INDIAN GRAMMARS AS DOORWAYS TO FREEDOM 4l 


7. SUMMARY. At this point it is not my purpose to debate what some call metaphysical matters. In 
this paper my goal was to reconsider and re-contextualize the work of ancient grammarians in terms of their 
philosophical, scientific and cultural beliefs in order to better appreciate the magnitude of their achievement. 
Twentieth century scholars like Saussure, Firth, Jakobson and Bloomfield knew of Panini’s grammar and 
valued it highly. Distinctions like /angue and parole clearly show evidence of the work of early Indian 
grammarians, which is not surprising, given that Saussure taught Sanskrit for over thirty years, and was 
clearly aware of early Indian scholarship. However, the more I learn not only of Panini, but also of 
Katyayana, Patanjali, Bhartrhari, Abhinavagupta and other ancient Indian grammarians, the more I am 
astounded by the breadth and depth of their linguistic concerns. 

To them, language was not just about morphological roots and stems or cultivating a precise usage 
amidst dialectal variation in order to preserve the purity of their oral scriptures. Language also had a role to 
play in the creation of the universe and the evolution of human consciousness as mapped out in its four 
stages or levels of speech: para vak, pasyanti vak, madyama vak, and vaikhari vak. As twentieth first century 
grammarians, we can no longer afford to focus on ordinary speech without recognizing that vaikhari level 
speech originated in madhyama, which came from pasyanti, which originates in para vak, the highest level 
of speech, which included all sound as potential in perfect silence: completely empty, completely full. The 
ancient grammarians were not satisfied by a limited view of language as a formal system of rules: they did 
not separate it from its social, psychological and philosophical context. Nor should we. 


REFERENCES 


BANERJI, S. 1996. Historical survey of ancient grammars. Delhi: Sharada Publishing. 

BHATE, S. 2004. Patanjali, the Bhasyakara. In Panini to Patanjali: A grammatical march, ed. by B. Ray, 
128-42. 

BLOOMFIELD, L. 1933. Language. Chicago: University of Chicago Press. 

CHIDVILASANANDA,G. 1999. Courageandcontentment. A Siddha Yoga Publication. 

FLOOD, G. 1996. A history of Hinduism. Cambridge.CUP. 

ISAYEVA,N.1997. Fromearly Vedanta to Kashmir Shaivism. Delhi: SriSatGuru Publications. 

IWASAKI, Y. 2004. Development of Sanskrit grammar after Panini. In Panini to Patanjali: A grammatical 
march, ed. by B. Ray, 95-114. 

KULKARNI, M. 2004. Contributions of Katyayana to the Paninian grammar. In Panini to Patanjali: A 
grammatical march, ed. by B. Ray, 115-127. 

MULLER-ORTEGA, P. 1989. The triadic heart of Shiva. New York: State University Press. NENE, S. 2004. 
Panini s contributions to Sanskrit grammar. In Panini to Patanjali: A grammatical march, ed. by B. 
Ray, 62-79. 

PADOUX, A. 1990. Vac. Delhi: Sri Satguru Publications. 

RAGHAVAN, V. 1981. AbAinavagupta andhisworks (Chap. III Paryantapancasika). Delhi: Chaukhambha 
Orientalia. 

RATH, K. 2004. Genesis of Sanskrit grammar. In Panini to Patanjali: A grammatical march, ed. by B. Ray, 
41- 61. New Delhi: D. K. Printworld Pvt. Ltd. 


4? KAREN/SRUTI MALCOLM 


RAY, B. 2004. Panini to Patanjali: A grammatical march. New Delhi: DK Printworld Ltd. 


SINGH, J. 1988. A tridentofwisdom. (transl. of Abhinvagupta’s Paratrisika- Vivarana),ed. by B. Baumer. New 
York: State University Press. 


This article was first published atlacus.weebly.com. 


LACUS Forum 41:3.43-55 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 





Attp://creativecommons.org/licenses/by-nc/3.0/. 
Copyright © 2020 the Linguistic Association of Canada and the United States. 


SOME LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY: 
TWO GOVERNORS AND THEIR COMMUNITIES 


JACQUELINE NENCHIN SUSAN FELTMAN 
Molloy College Westchester Community College 


Abstract: This research examines certain linguistic aspects of the speeches of Governors Christie (NJ) and 
Cuomo (NY), during the media coverage of the natural catastrophe Superstorm Sandy in their communities, 
from phonological and lexico-grammatical perspectives. The purpose of this investigation is to add to the 
growing body of research on language in the media and social life. A two-pronged investigation involves (a) 
evaluating focus groups perceptions of Governor Cuomo’s use of the New York City Dialect and Governor 
Christie's use of the North New Jersey dialect and (b) analyzing some lexico- grammatical aspects of the 
governors speeches from a Systemic Functional Linguistics approach. Results from the interpretation of the 
data on the Superstorm Sandy speeches indicates that each governor employs unique lexico-grammatical 
choices expressed orally in different dialects of English, thereby creating different meanings, and that one 
governor is perceived as a more effective speaker and a stronger leader. 


Key Words: Language in politics, language and power, New York City dialect, North New Jersey dialect, 
Systemic Functional Linguistics 


LANGUAGE IS AT WORK in society in good times and in bad. On October 22, 2012, a tropical wave in 
the Caribbean quickly developed into Tropical Storm Sandy. In two days, it was upgraded to hurricane 
status; thus, it became Hurricane Sandy, and soon thereafter it became apparent that it would hit the Eastern 
Seaboard of the United States. This news triggered a flurry of media coverage and renamings of the storm, 
from Snow-icane Sandy (Amira 2012) and Snor-eastercane Sandy (Doll 2012) to Frankenstorm Sandy 
(CBS/AP 2012) due to its proximity to Halloween, and finally to Superstorm Sandy (CNN 2012), the name 
by which we refer to it, because when it reached New York and New Jersey, the linguistic region under 
study, it had been downgraded to a post-tropical cyclone. Though its status had been downgraded, a 
confluence of factors, such as a full moon and three high tides, turned it into the most devastating storm to 
hit the New York area in two centuries, with 53 deaths in New York State and 34 in New Jersey, most due to 
drowning (Centers for Disease Control, 2013), and billions in property and infrastructure losses. The media 
coverage associated with the storm included many press briefings by Governors Christie and Cuomo; these 
briefings figured significantly in the preparedness before and during the storm and in the consequences in 
the wake of the storm. 

We became interested in the press briefings by New Jersey Governor Christie, a speaker of North Jersey 
English, and New York Governor Cuomo, a speaker of New York City English, and in the ways they 
conveyed essential information and instructions to their constituents during the crisis. Both governors are 
seen as potential candidates for the US presidency, and Governor Christie in particular has been active on 


44 JACQUELINE NENCHIN AND SUSAN FELTMAN 


the national political stage, so their interactions with their constituents may have an even wider significance 
in the future than during the natural disaster in question. The way citizens view their leaders is a complex 
issue, and leaders and their ways of communicating are also complex, so these speeches have the potential 
of revealing many different aspects of the governors’ public linguistic performance. 

Press briefings differ from other types of speeches made by politicians, such as state of the state 
addresses, budget addresses, or other public addresses. They are not planned for an extended period 
beforehand, they are not ghostwritten, and they are at least partly adlibbed. The drama in press briefings is 
less rehearsed than real, because briefings generally respond to urgent issues of great concern. The press 
briefings by the governors in response to Superstorm Sandy are both urgent and spur-of-the- moment, and 
are, therefore, a good gauge of the governors’ unrehearsed language use. In this regard, the conditions 
mentioned in the text samples resemble the samples in a study by Labov & Waletzky (1967:5), in which 
participants were asked to relate a story about a frightening event (“Were you ever in a situation where you 
were in serious danger of being killed?”) in order to obtain a speech sample with minimal possible 
monitoring. 

The context of the speeches and the texts are integrally linked. Halliday (2002:15) argues that language 
can be studied through the context or the text: “Text and situation come into being together; so whatever 
kind of order we set up between them, it must be such that we can start from either end.” In her study of 
news discourse, Lukin clarifies: 


Hasan’s idea that “there is a continuity from the living of life on the one hand right down to the 
morpheme on the other” (Cloran et al. 1996: 1). The implication here is each and every morpheme 
is recruited to the business of construing the context - which in this case is one where meaning is 
highly contested. (Lukin 2013:2) 


The press briefings by Governors Christie and Cuomo have basically the same context. 

During Superstorm Sandy, the governors react to the same situation, but a study of their use of dialect at 
the phonological level and language at the morpheme level reveal different meanings that are perceived 
differently by their constituents. Other issues such as the physical demeanor of the governors and higher 
frequency of Christie media coverage over Cuomo media coverage, resulting in more information coming 

to New Yorkers from New Jersey than from New York, are not within the scope of this study, though 
they have significance for the overall situation; for example, even a Google search for news video clips (“Chris 
Christie and Sandy” and “Andrew Cuomo and Sandy’) elicits 2,290,000 hits for Christie, but only 157,000 
for Cuomo. 

This paper investigates the governors language use through a two-pronged analysis involving (a) an 
evaluation of focus groups’ perceptions of Governor Cuomo’s use of the New York City dialect and 
Governor Christie’s use of the North New Jersey dialect with regard to variation (Labov 1966) and (b) a 
lexico-grammatical analysis from a Systemic Functional Linguistics approach (Halliday & Matthiessen 
2014). The purpose is to discover 


(a) if dialect use by the two governors affects the way that they are perceived by listeners and 

(b) how the governors’ choices of mood and modality differ from one another and affect their 
meaning. The findings were combined to provide insight into the ways that the two leaders 
convey meaning in their interactions with their constituents, insight that can be applied to a 
better understanding of how language is used in politics and social life. 


1. BACKGROUND. The theoretical underpinnings center on the social and contextual orientation to 


LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY os, 


language use (Labov1966; Halliday 2009; Hasan 1996) by key players during the societal crisis initiated by 
Superstorm Sandy in 2012. This study involves listeners’ perceptions of dialects and speakers’ uses of 
lexicogrammar. Halliday (2009) defines a dialect as “a variety of language distinguished according to the user: 
different groups of people within the language community speak different dialects” (16). A person normally 
speaks only one dialect, which “represents the total range of patterns used by his section of the community” 
(22). Speakers of a dialect share most phonetic and phonological aspects of the dialect, though the normal 
assumption is that an individual differs in performance from other members of his language community 
“through idiosyncratic phonetic habits” (25). However, Halliday argues that a person’s idiolect can also be 
distinguished in written texts by lexicogrammar. 

Not all dialects of English are viewed in the same way by other dialect communities. Halliday (2009:31) 
contends that “there is actually no such thing as a slovenly dialect or accent” or a debased dialect. That 
language develops along different trajectories in different communities does not indicate that either one or 
the other is higher or more fully developed. Halliday remarks that urban dialects in Britain are sometimes 
labeled as harsh or guttural and are, therefore, viewed as unpleasant and lacking prestige. He explains that 
such attitudes are learned. This is the case with New York City English, which is the focus of deep-rooted 
negative attitudes among members of the American English-speaking community (Labov 1966; Labov et 
al. 1986). 

Halliday (2009:199) maintains that “social interaction takes a linguistic form, which we call text. A text 
is the product of indefinitely many simultaneous and successive choices in meaning and is realized in 
grammatical structure.” Speakers or writers make selections from all the options of lexicogrammar to create 
meanings thatthey wish to convey. All of the selections that are available make up the meaning potential of 
that language. Human interaction and social processes are dynamically and reciprocally linked with 
language as it is used by members of the language community and by the lexicogrammatical choices that 
they make as they communicate with each other. 


2. METHOD. The two questions guiding this research are, “Does dialect use by the two governors affect 
the way that they are perceived by listeners?” and “How do the governors’ choices of mood and modality 
(a) differ from one another and (b) affect their meaning?” The data comprises speeches by the governors 
before, during, and after Hurricane Sandy. These texts were unrehearsed and represent narratives of and 
responses to events taking place during a time of life-threatening crisis (cf. Labov & Waletzky, 1967). 
Additionally, focus groups from New York and New Jersey were used to determine the participants’ 
perceptions of the speakers through listening to select speeches (Labov et al. 1986). A two- minute excerpt 
from a press briefing by each of the governors was played to the focus groups, who then answered a 
questionnaire about the recordings. 

To answer the first question, which sought to determine the effect of the dialects on listeners, we looked 
at a few aspects of the governors’ use of sounds. Because the two governors are well-known speakers of these 
common dialects in the NY metropolitan area, Governor Christie of North Jersey English and Governor 
Cuomo of New York City English, we did not analyze their speech per se, but singled out a few salient 
phonological markers of the governors’ dialects for note. These included the post-vocalic r, th fortition, and 
the split a. The two dialects have many similarities but differ in these features (see Table 2 for examples 
from the speeches). 

As it is not possible to poll all New Jerseyans and New Yorkers, which would give us a complete picture 
of attitudes towards the dialects in question (Labov et al. 1986), we assembled two focus groups, one from 
North Jersey and one from Long Island, areas where the dialects are prevalent. People aged 21 and older 
were asked to volunteer for the focus groups. Twenty-six people volunteered, 20 from New York and 6 from 
New Jersey. The age range was from 21-83, with the oldest participants (70+) in the New York group. The 


46 JACQUELINE NENCHIN AND SUSAN FELTMAN 


under 30 participants included four New Yorkers and two New Jerseyans. A broad range of occupations 
was represented, from student to retiree and from grocery store manager to software designer and lawyer. 
Most volunteers had some postsecondary education, and some had graduate degrees. All participants were 
naive as to the purpose of the study and (initially) to the identities of the speakers. 

A focus group questionnaire was designed to explore the participants’ perceptions of the governors’ 
speeches (see Appendices A and B). The questionnaire asked each person to identify his/her age, gender, 
and occupation. After that, the questionnaire was divided into three sections. The first section was to be 
filled out after the first excerpt was played; the second was to be filled out about the second excerpt, and the 
third section asked the participants to compare the two speakers. Each focus group was instructed to focus 
on the speeches and not on any feelings that they might have if they recognized the speakers. A two-minute 
segment by each governor was played. Participants listened to each one, after which they answered the 
survey questions, which did not attend to phonological variables since reactions to them are “below the level 
of conscious awareness .. . [and] there is no vocabulary of socially meaningful terms with which our 
informants can evaluate speech for us” (Labov, 1967:405-406). 

The data from the questionnaires were collected and tallied. The results were organized into tables 
based on age, gender, and state, after which the data were summarized. 

The second research question concerning the governors’ choices of mood and modality was 
investigated using specific observer perspectives offered within the Systemic Functional Linguistics (SFL) 
framework, in particular the interpersonal metafunction. This perspective was used as a tool to discover 
how the governors communicated with their constituents and the media through the use of mood and 
modality. Butt et al. (2012:111) explain that “for interpersonal meanings the crucial relationship is between 
grammatical functions. The two grammatical features that carry the burden of the meaning are the Subject 
and the Finite. They combine to make the mood of the clause.” Studying the interpersonal metafunction 
reveals which mood was the most common (declarative, imperative, or interrogative) and which modal 
verbs and other forms of modality, such as modal adjuncts like probably and interpersonal grammatical 
metaphors like J think, were used. 

For the SFL analysis, two excerpts were chosen from among each governors’ speeches. Though much 
of the texts were informational (propositions on the level of semantics), we chose to focus on excerpts that 
were mostly procedural, since their purpose is to instruct constituents in how to do something, in this case, 
in how to survive Superstorm Sandy and its aftermath. On the semantic stratum, this type of text involves 
proposals, demanding goods and services (commands), and modality (semantically between propositions 
and proposals). An SFL interpersonal analysis identifies subject, finite, modal finite, predicator (the rest of 
the verbal group), complement, adjunct (prepositional phrases and adverbial groups that express 
probability, usuality, typicality, or obviousness) , mood block (the subject and the finite), residue (the parts 
of the sentence not in the mood block), and interpersonal grammatical metaphor (J think/T believe). Table 
1 presents an example of interpersonal analysis. 


Table 1. Example of Interpersonal Analysis 
Constituency 





Interpersonal Finite: Predicator Complement 
Metafunction present- 
iapetsive Mood) 


The excerpts analyzed comprised 64 clauses from two press briefings by Governor Christie and 62 clauses 


LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY 47 


from two press briefings by Governor Cuomo. 


3. RESULTS. Results from the interpretation of the data on the Superstorm Sandy speeches as presented 
in press briefings indicated that each governor employed unique lexico-grammatical choices expressed 
orally in different dialects of English, thereby creating different meanings, and that one governor is 
perceived as a more effective speaker and a stronger leader. Table 2 provides an overview of three of the 
features that distinguish the governors’ dialects: 


Table 2. The Governors’ Dialects 
North New Jersey Dialect: New York City Dialect: 
Governor Chris Christie Governor Andrew Cuomo 


Rhotic: pronunciation of the post-vocalicr Non-rhotic: r-lessness 
Examples: shelters/first/upper/stormsurge Examples: weath uA (weather) 
engineez (eers), powuA (er), stawm 


(storm) 
No th- fortition (though some North New th- fortition: Dental “t” and “d” instead 
Jersey speakers do have it) of th 

Example: duA day (the day) 


No short “a” split Short “a” split 
(different pronunciations for bad/bat) 





3.1. QUESTIONNAIRE RESULTS. The questionnaire results provided insight into the participants’ 
perceptions of the governors’ dialects. The results to the questions on the individual excerpts (see Appendix 
A) are presented first. Then the results to the comparison questions (see Appendix B) are shown. 

The New York focus group responded to all questions on the survey except for the first question, which 
asked them to positively identify the speaker. Almost half of them left that question blank. Some New 
Yorkers (20%) did not recognize Cuomo’s voice and mistook him for other NY politicians, such as Senator 
Charles Schumer or New York City Mayors Mike Bloomberg and Bill de Blasio. The same number did not 
recognize Christie, and 15% did not correctly identify his state. Most participants said that they found both 
Christie’s voice and Cuomo’s voice to be pleasant and confident, and they said both governors sounded 
educated. When asked if they liked listening to Christie, 27% of participants answered that they did. 

The New Jersey focus group responded to the survey on the individual excerpts (see Appendix A). New 
Jerseyans rated them equal in preference. All New Jerseyans recognized both Christie's and Cuomo’s voices, 
though one identified him as his father, Mario Cuomo (though she said later that she knew it was Andrew 
but had forgotten his name at first), and another simply as the governor of New York. Most participants 
said that they found both governors’ voices to be pleasant, confident, and educated, though Cuomo was 
slightly more likely to be perceived as uneducated. When asked if they liked listening to the governors, 38% 
felt neutral towards Christie and 31% felt neutral towards Cuomo. Only 15% liked listening to Christie and 
27% liked listening to Cuomo. New Jerseyans were equally split between liking, disliking, and feeling neutral 
about listening to both governors, while New Yorkers preferred listening to Cuomo. It may be difficult for 
people to set aside their political opinions to view the speakers’ texts less subjectively, but they were instructed 
to do so. It may also be that the relationship between linguistic performance and political performance 
cannot be entirely separated even on the unconscious level, but every effort was made to do so. 

The responses to the Comparison Survey (see Appendix B) yielded information from both multiple 
choice and open-ended questions. In answer to the multiple- choice question, New Yorkers and New 


48 JACQUELINE NENCHIN AND SUSAN FELTMAN 


Jerseyans gave different answers. Most New Yorkers and more female than male New Yorkers over 45 (58%) 
preferred to listen to Gov. Cuomo, whereas most young New Yorkers (21-29) preferred to listen to 
Governor Christie whether they recognized his voice or not. New Jerseyans and young New Yorkers were 
more likely to perceive Cuomo as uneducated than Christie. Overall, most found both governors’ speeches 
to be believable and effective, but New Yorkers over 45 found Cuomo’s to be more effective. The responses to 
the open-ended questions “Is there anything in particular that you like about the voice of Speaker 1/2?” and 
“Is there anything in particular that you dislike about the voice of Speaker 1/2?” elicited a variety of 
responses, both positive and negative, some of which are provided in Table 3. 


Table 3. Results from the Survey’s Open-Ended Questions 
Is there anything in particular that you | Is there anything in particular that you 
like or dislike about the voice of Speaker 


like or dislike about the voice of Speaker 


1? (Governor Christie) 


Adjectives: 
Soft, understanding but affirmative (NJ) 


Too monotone (NJ) 

Informative (NY) 

Dull, hesitant (NY) 

Warm and encouraging (NY) 

Nasal, whiny (NY) 

Congenial (NY) 

Rushing (NY) 

Phrases and clauses expressing likes: 


There was concern in his voice for the 
people of NJ. (NJ) 

Speaks from heart, trust us, we will help 
you. (NJ) 

Gave me a warmer feeling. (NJ) 

Clearly spoke about details (NY) 
Phrases and clauses expresssing dislikes 
He said “uhhh” a lot. (NJ) 

Likes to listen to [him to] make fun (NJ) 
He sounded exhausted and stuffy. (NJ) 
Distracting impediment (NY) 


2? (Governor Cuomo) 


Adjectives: 
Very nasal. Very whiny (NJ) 


Soft spoken (NY 
Clear (NY) 


Phrases and clauses expressing likes: 


His voice was more clear and concise. (NJ) 
NYC accent (NY) 


Easy on ears (NY) 


Phrases and clauses expresssing dislikes 
I didn't get that. (NJ) 


His voice is irritating. (NY) 

A bit condescending: “don’t go to the beach 
and take photos” “be smart” (NY) 

Sounds more like he is reading and not 
talking from the heart. At a time of 
catastrophe you want to hear dependability 


and understanding. 
NYC accent (NY) 





One of the participants mentioned Cuomo’s use of d instead of th (th-fortition) and expressed 
embarrassment for the speaker. Another participant insisted that Governor Christie has a speech 


LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY 49 


impediment. The comment about Governor Christie rushing may be explained, at least in part, by the fact 
that he used about 33% more words in the 2-minute excerpt than his New York counterpart. 


3.2. SFLINTERPERSONAL ANALYSIS. The results of the SFL interpersonal analysis reveal the ways in 
which the two governors use modality. The analysis here is on the level of the major clause, which is the 
fundamental meaning structure for communicating with one another linguistically (Halliday & Matthiessen 
2014). Each clause consists of an event, which is expressed by a verbal process. The clauses under analysis 
here are ranked clauses, identified by the traditional terms: independent, dependent, and nonfinite clauses. 
Embedded, or rankshifted, clauses are not included. 

The interpersonal analysis deals with meanings encoding the relationship between speaker and listener. 
This is accomplished through the system of mood. The dominant mood in the speeches by both governors is 
declarative, and the second most frequent type of mood in both is the imperative (see Tables 4 and 5). 


Table 4. Mood in Governor Christie’s Texts 


Mood Text 1 Text 2 Total 
43 clauses 21 clauses 64 clauses 





Imperative 10 10 
pe Litany PP 
OO 


Interrogative i 


Table 5. Mood in Governor Cuomo’s Texts 


Mood Text 1 Text2 | Total 
21 40 62 clauses 
clauses clauses 


Declarative | 17 


Imperative 4 [2 | 6and2 | 12 
aa 


Both use two negative commands with don‘, but only Governor Cuomo uses the jussive imperative (/et’s) 
twice. The New Jersey group is smaller than the New York group. 

The analysis of modality reveals differences in the governors’ use of modal verbs, mood adjuncts, and 
grammatical (interpersonal) metaphor. There are three values of modality, according to Halliday & 
Matthiessen (2014): high, median, and low. The modal verbs are classified according to value in Table 6. 





Table 6. Three values of modality 


must, ought to, need, has/had 
to 





— will, would, should, is/was to 


can, may, could might 


Caz has the sense of probability, as in There can be any number of misunderstandings. Can in the sense of 


50 JACQUELINE NENCHIN AND SUSAN FELTMAN 


ability/potentiality is, according to Halliday & Matthiessen (2014:696) “on the fringe of the modality system” 
and is connected to the idea of readiness (being inclined and able to). Like can, be able to is of low value 
and falls into this same type of ability/potentiality. 

Mood adjuncts can also be grouped according to type and value (Halliday & Matthiessen 2014:188-89). 
Thus, mood adjuncts of modality include probability and usuality. In the probability group, definitelyand 
certainlyare of high value, probably is of median value, and maybe and possibly are of low value. There are 
also mood adjuncts of intensity, including degree and counterexpectancy types. The value of 
counterexpectancy modals are categorized by the values of exceeding (even, actually, really, in fact) and 
limiting (just, simply, merely). 

The findings show that Governor Christie uses only can, could, and be able to modal verbs of 
ability/potentiality, as in the clause as quickly as we can (see Table 7). 


Table 7. Use of modality in Governor Christie's texts 


Text 1 [number of times in | Text 2 [number of | Total [number of times 
a times in aad in = 
Can.) Can [2 Can [3 


Modal verbs 


ie | 


Be = to [ Be able to [ 


mood Mayet Can[2 
=e 
Grammatical I think/guess/mean 
<= —— 
He also uses the mood adjunct maybe, which has low value. 
In contrast, Governor Cuomo uses a wider range of modal operators, ranging from the low value can 


and could to willand should, which are of median value, as in the clause people should keep themselves 
intormed (see Table 8). 





Table 8, Use of modality in Governor Cuomo’s texts 


Text 1 [number | Text 2 [number | Total [number 
Sy of times in text] | of times in text] | of times in text] 
Modal verbs | Should [1] Can [4] Should [1] 
Will [1] Will [4] oo mu | 


Mood Really [2 
adjuncts sn 
Grammatica 
| 
metaphor 
Additionally, he uses the modal adjunct rea//y, which is a mood adjunct of intensity, not probability. It 


indicates counter-expectancy and has high value. He also uses the verb group complex need + infseven 
times in Text 1, including in the passage below: 





LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY 51 


Citizens have a 
responsibility 

too. They need to 
be smart. 


They need to use 
common sense. 
Theydonotneedto 
beontheroad. You 
really dont. 


You need to leave the roads free. 


Need + infinitive is not a modal operator; it is an example of projection in the hypotactic (dependent) verbal 
group complex. However, Governor Cuomo uses need in these verbal group complexes as an agnate for 


should, e.g., they should be smart. 


Unlike Cuomo, Governor Christie uses the interpersonal metaphor, J think/T guess, a metaphorical 
realization of probability when, for instance, he says, [think all ofour mothers taught us ... don't be stupid. 
Other examples include: 


The margin for error for me being wrong and you going and staying at 
a friend's house for two days is significantly better than winding up 
witha severe injury or death for yourself or for your family, I think. 


I guess I've used one Line already. 


These instances of interpersonal metaphor align with the rest of the interpersonal profile of Christie’s text 
as dominated by probability and ability/potentiality. 


4. DISCUSSION. This investigation involved research into attitudes towards the dialects used by 
Governors Christie and Cuomo and their use of modality when communicating with their constituents 
during the crisis brought on by Superstorm Sandy. The focus of the research was to look at the press 
briefings both from the standpoint of listeners reacting to the speakers’ different dialects and from the 
standpoint of the speakers using modality, as evident through an analysis of the interpersonal metafunction. 

The New York City dialect and the North New Jersey dialect have been extensively studied, especially 
by Labov (1966) and Labov et al. (1986), so it was not the purpose of this research to further investigate any 
changes in the dialect itself. The purpose of this research was, however, to consider the role of dialect in 
social life, in particular during the societal crisis presented by Superstorm Sandy. The answer to the first 
question “Does dialect use by the two governors affect the way that they are perceived by listeners?” is that 
the governors have distinct dialects and speaking styles that differ from one another and that are perceived 
differently. Governor Cuomo is viewed by the majority of listeners as more effective on the one hand but 
as uneducated on the other. His dialect is clearly recognized as the New York City dialect, and some 
respondents, including New Yorkers, view it ina negative way, having commented on it in the open-ended 
questions, thereby supporting the continued stigmatization of the dialect. Cuomo’s speech found more 
support among older (60+) New Yorkers than any other group. The youngest respondents prefer Governor 
Christie’s speech, whether they were from New York or New Jersey, and whether they recognized his voice 
or not. 


52 JACQUELINE NENCHIN AND SUSAN FELTMAN 


The answer to the second question “How do the governors’ choices of mood and modality (a) differ 
from one another and (b) affect their meaning?” is that the governors’ choices of modality are distinguished 
from one another with regard to modal verbs, mood adjuncts, and interpersonal grammatical metaphor. 
These differences represent the governors’ diverse styles of interaction with their constituents. The 
procedural texts by the governors include modalized clauses, which are “in principle ambiguous as between 
proposition and proposal” (Halliday & Matthiessen 2014). Through modality, the indicative clause becomes 
more like a proposal and are, in effect, a more delicate way of giving a command (Halliday & Matthiessen 
2014). Governor Cuomo’s press briefings include greater use of the imperative mood, including the jussive 
imperative (let’s), the use of modals of obligation (will and should) in addition to the modal of probability 
can, and verbal group complexes with the verb need plus infinitive. Governor Christie uses modals of 
ability/potentiality, mood adjuncts of probability, and grammatical metaphor (probability). These results 
counter the public view of Governor Christie to some extent. Christie is often criticized in news media and 
among people for being bossy and brash, yet his lexico-grammatical choices are, in fact, more subdued than 
Governor Cuomo’s, whose choices of modality are of higher value and oblige his listeners to follow his 
instructions. 


5. CONCLUSION. The New York City dialect continues to be stigmatized. Its use by politicians may affect 
the public’s view of them, especially outside of their region and among younger constituents. Clear diction 
and the use of modals of obligation seems to have contributed to the effectiveness of Cuomo’s speech, despite 
his NYC dialect. The research has implications regarding the usage of dialects by politicians and the role of 
mood and modality in political speech. This is a small-scale study of downstate New Yorkers and North New 
Jerseyans. A larger study on a statewide or national scale could produce different results and have important 
implications for national political races. Additionally, more research on other aspects of the governors’ 
speeches could offer further perspectives on the two governors, including a multimodal assessment of their 
press briefings. 


REFERENCES 


AMIRA, D. 2012. It’s time to start freaking out about Snow-icane Sandy. New York Magazine, October 
25. Accessed November 1, 2014. http://nymag.com/daily/intelligencer/2012/10/hurricane-sandy-new- 
york- storm- snowicane.html?mid=twitter_nymag 


BUTT, D., R. FAHEY, S. FEEZ, &S. SPINKS. (2012). Using functional grammar: An explorer’s guide, (3* d 
ed.). South Yarra, Vic., Australia: Palgrave. 

CBS/AP. 2012. Hurricane Sandy may slam into East Coast as Halloween Frankenstorm. 

CBS News, October 25. Accessed November 1, 2014 http://www.cbsnews.com/news/hurricane-sandy-may- 
slam-into-us-east-coast- as- halloween-week-frankenstorm/ 

CENTERS FOR DISEASE CONTROL. 2013. Deaths associated with hurricane Sandy October November 
2012. Morbidityandmortality weeklyreport16(20), 393-397, October 25. Accessed November 1, 2014 
http://www.cdc.gov/mmwr/preview/mmwrhtml/mm6220a1.htm. 

CNN. 2012. Superstorm Sandy’s toll: Mounting deaths, historic destruction, stranded residents. CVN 
News, October 30. Accessed November 1, 2014. http://news.blogs.cnn.com/2012/10/30/superstorm- 
sandys-wrath/ 

DOLL, J. 2012. How to feel about a snor’eastercane. The Atlantic, October 25. Accessed November 1, 


LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY 20 


2014. http://www.thewire.com/national/2012/10/how-feel-about- snoreastercane/58352/ 

HALLIDAY, M.A. K. 2002. Howdo you mean?Ongrammar. Volume linthe Collected Works of M.A.K. 
Halliday, ed. by J. Webster. London: Continuum. 

---. 2009. Language and society. Vol. 10 in J. J. Webster (Ed.), Collected works of M. A.K. Halliday. 
London: Continuum. 


HALLIDAY, M. A. K., & MATTHIESSEN, C. M. I. M. 2014. Zntroduction to Functional Grammar (4'b 
ed.). London: Routledge. 

HASAN,R. 2009. Semantic Variation: Meaning in Society and in Sociolinguistics. The Collected Worksof 
Rugaiya Hasan, Vol2.London: Equinox. 

---. 1996.The grammarian’s dream: Lexis as most delicate grammar. Ways of Saying, Ways ofMeaning. 
Selected Papersotkugaiya Hasan, ed. by C. Clora, G. Williams and D. Butt. London: Cassell. 

LABOV, W. 1966. TheSocialStratificationotEnglishin New York City. Washington, D.C.: Center for Applied 
Linguistics. Second edition: Cambridge/Cambridge U. Press, 2006. 

LABOV, W., D. GRAFF, & W. A. HARRIS. 1986. Testing listeners’ reactions to phonological markers. In 
Diversity and diachrony, ed. by D. Sankoff, 45-58. Philadelphia: John Benjamins. 

LABOV, W., & J. WALETZKY. 1967. Narrative analysis: Oral versions of personal experiences. Journal of 
Life History, 11-4), 3-38. 

LUKIN, A. 2013. Journalism, ideology, and linguistics: The paradox of Chomsky’s linguistic legacy and his 
‘propaganda model.’ Journalism, 14 (1), 1-15. 


54 


JACQUELINE NENCHIN AND SUSAN FELTMAN 


Appendix A 


Survey Questions for Each Excerpt 


1. When I listen to this excerpt, I hear a familiar/unfamiliar voice. 
(Circle one of the underlined words.) 


2. Circle one of the following and fill in the blank if you can: 
a. I know who this speaker is. It is 


b. I think I know who this speaker is. I think it is, 
c. I don’t know who the speaker is. 


3. This speaker's voice is 
a. pleasant 
b. unpleasant 
c. neutral 

4. This speaker sounds 
d. confident 
e. not confident 
f. I don’t know 


5. The speaker sounds 
a. educated 
b. uneducated 
c. I don’t know 


6.1 listening to this speaker 
a. like 
b. don't like 
c. feel neutral about 


7. The speaker is from 


a. New York 
b. New Jersey 
c. Another state 


LINGUISTIC PERSPECTIVES ON SUPERSTORM SANDY 


Appendix B 


Survey Comparison Questions 


Now that you have listened to both excerpts, please answer the following questions. 


1. Which speaker do you prefer to listen to? 


a. Speaker 1 
b. Speaker 2 
c. Neither 


2. Which speaker do you believe? 
a. Speaker 1 
b. Speaker 2 
c. Neither 
d. Both 


3. Which speaker is more effective? 


a. Speaker 1 
b. Speaker 2 


c. Neither 
d. Both 


4. Is there anything in particular that you like about the voice of Speaker 1? 


5. Is there anything in particular that you dislike about the voice of Speaker 1? 
6. Is there anything in particular that you like about the voice of Speaker 2? 


7. Is there anything in particular that you dislike about the voice of Speaker 2? 


This article was first published at lacus.weebly.com. 


Do 


LACUS FORUM 


Journal of the Linguistic 
Association of Canada and the 


United States 
VOLUME 41 NUMBER 4, 2020 
CONTENTS 
15. HEBREW IN HUNGARIAN: A LEXICAL COMPARISON OF SOME TEXTS 1 
Judith Rosenhouse 
16. THEORETICAL AND PRACTICAL CONSEQUENCES OF CHOMSKY’S ASSUMED 14 
SEPARATION OF “MENTAL DEVICES” FOR LEARNING AND UNDERSTANDING 

Presented at LACUS 2015 

Douglas W. Coleman 
17. HUMAN AND MACHINE TRANSLATION QUALITY: 24 


DEFINABLE? ACHIEVABLE? DESIRABLE? 
LACUS 2012 Presidential Address 
Alan K. Melby 


18. FORM VERSUS CONTENT AND INPUT VERSUS OUTPUT: THE DUAL 44 
BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING 
LACUS 2014 Presidential Address 
H. Stephen Straight 


19. ON RHYTHM AND INTONATION IN THE SPEECH OF BARACK OBAMA 35 


Lucas van Buuren 


LACUS Forum 41:4.1-13 (2020) 


LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





HEBREW IN HUNGARIAN: 
A LEXICAL COMPARISON OF SOME TEXTS 


JUDITH ROSENHOUSE 
SWANTECH Ltd 


Abstract: Jews have been using Hebrew elements for mainly religious needs of communication in Hungary. 
However, various researchers have found about 300 words also in Hungarian daily, non-Jewish vocabulary. 
This paper compares Hebrew in Jewish Hungarian secular belletristic texts. The research questions of this 
study ask which Hebrew elements appear in such written Hungarian texts, and how the findings can be 
explained. To find some answers, I examined four books written in Hungarian: two books by each of two 
famous Jewish writers, Ferenc Molnar and Ephraim Kishon, who were born in Budapest and had Hungarian as 
their mother tongue. The study revealed differences in the occurrence of Hebrew elements in their works. 
These findings reflect different linguistic features in each of these writers’ works, which are apparently due to 
the changing historical and sociolinguistic events during the lives of Molnar and Kishon in the 20" century. 


Keywords: Hebrew language; Hungarian language; F. Molnar; E. Kishon; Sociolinguistics; Secular belletristic 
texts 


Languages: Hebrew, Hungarian 


MODERN HEBREW AND HUNGARIAN are not related languages; yet there are speakers of these 
languages who use them as L1 or L2 in Hungary, Israel or elsewhere. So far, however, very little research 
has dealt with this subject as a case of bilingualism (Rosenhouse 2015, to appear). Most of the relevant 
existing literature focuses on the historical development and cultural and literary features of Jewish 
literature in Hungary (e.g., Balla 1969, Patai 1996, Heller 1997, Szalai 2002, Sanders 2007, K6banyai 2012). 
Few works refer to the linguistic aspects of the contact between Hebrew and Hungarian in spoken, non- 
artistic, secular, daily discourse (e.g. Vago 1980; Rosenhouse 2012a, 2012b). The present study begins the 
exploration of just this aspect of language use, not in speech but in written secular texts. Studies of this 
subject, which differs from studies of liturgical or religious literature, are not numerous, to the best of my 
knowledge, although research has found about 300 words of Hebrew origin integrated in Hungarian directly 
or through Yiddish (Blau and Lang, 1941/1995, Biro 2004, Banyai & Komordéczy 2013).' 

Before delving into our study, we should note that evidence from the 2"*-3" century CE attests that Jews 
lived in Hungary, ie., the Pannonia Basin, long before the Magyars settled there. There are tombstones 
written in Hebrew, mentioning Jewish names, some tombstones decorated with Jewish symbols such as the 


' These studies refer mainly to their lexical origins of these items but with little reference to their phonetic, semantic, or other 
linguistic features. 


2 JUDITH ROSENHOUSE 


menorah, as well as coins with Jewish symbols and documents of liturgical and secular genres (Patai 1996, 
Komoroéczy 2012, Ujvari 1929:352-353, Szalai 2002). Cultural documents and printed books were first 
written in Hebrew and Aramaic, and later, during most of the millennium, in Yiddish, German and since the 
middle of the 19 century - in Hungarian. Those works reflect the cultural and linguistic development of 
the Jewish communities along the centuries, revealing their flow concurrent with the historical changes in 
the region (cf. Komoréczy 2011). This literature has flowered and withered during the 20" century and is 
apparently blooming again (see e.g., Kébanyai 2012). 

In this context, it is interesting to see which and where Hebrew words, or their derivatives, appear in 
texts written in Hungarian.’ In Hungary, belletristic literature of Jewish Hungarian writers developed 
mainly since the middle of 19" century, when most of the Hungarian Jews made great efforts to become 
fully “Hungarized” citizens after the Hungarian independence war in 1848 and the emancipation decree of 
1867 for all minorities (cf. Patai 1996, Braham 1966, Kébanyai 2012.). In the 20" century, Jewish-Hungarian 
authors’ writing was thus mainly in Hungarian. Moreover, many Hungarian Jews hid their origin from 
common knowledge (as Ujvari 1929 writes) and many authors hid it in their writing (see Heller 1997, 
K6banyai 2012: 226-227). 

As a rather pioneering linguistic study, this paper focuses on the Hebrew elements in Hungarian literary 
texts of the 20" century in a few books by two renowned Jewish- Hungarian writers, Molnar and Kishon. 
The sections below present the method of work, the findings, a discussion, and our conclusions. 


1. METHODOLOGY: COMPARING HEBREW ELEMENTS IN HUNGARIAN LITERARY BOOKS. 
The definition of Jewish languages is a frequently discussed topic in many scholarly works about Jewish texts 
written in a Jewish-indigenous (non- Hebrew/Aramaic) language. Following Bunin-Benor’s (2009) 
approach to this question, I consider the texts studied here Jewish- Hungarian. Few works, however, examine 
this question regarding Jewish Hungarian. 

Some research deals with Jewish Hungarian literature (e.g., Ujvari 1929: 393-396, Patai 1996, Kébanyai 
2012). A few papers discuss sociolinguistic and phonetic aspects of spontaneous speech of Hungarian- 
speaking Jews in Israel (Vago 1980; Rosenhouse 2012a, b). The present paper, however, focuses on works 
written by two 20" century secular Jewish Hungarian authors in Hungary and Israel. 

Our basic research questions are whether there are any Hebrew elements in texts written in Hungarian 
by Jews, and if there are, how they are expressed when they are written in different environments. 

A preliminary survey of non-liturgical literature written by Jews in Hungarian (listed in, e.g, Balla 1966, 
K6banyai 2012 and the library of the Memorial Museum of Hungarian Speaking Jewry in Zefat, Israel) 
reveals a relatively large number of works, written in Hungary since the end of the 19" century, as well as in 
Israel (or Palestine, before 1948). From this literature, I have chosen to examine and compare two books 
written by each of two prominent Jewish Hungarian writers who shared a similar starting point, education 
and background. The first is Ferenc Molnar, (originally Ferenc Neumann, 1878-1952) who immigrated to 
the USA at the beginning of World War II. The other writer is Ephraim Kishon (originally Ferenc 
Hoffmann, then Ferenc Kishont, then Ephraim Kishon, 1924 — 2004), who immigrated to Israel in 1949. 
Both of them were born in Budapest but left Hungary because of the anti-Semitic Nazism and Fascism that 
intensified in Hungary after World War I. From Molnar’s books I have chosen A Pa/ Utcai Fiuk"The Boys of 
Paul Street’ (1906), which describes conflicts between two groups of boys in Budapest. Not merely a book 
for children or teen-agers, it is considered a criticism of wars and patriotic wars in particular. It has also 


* It would be also interesting to compare Hungarian texts written by Jews and Christians from this perspective, but this theme is 
beyond the goals of the present paper. 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS 3 


been described as ‘typically Jewish’. The other book is A Testér “The Guardsman’ (1912), a comic play in 
three acts printed in small format, which also served as the basis for an American film of that name in 1931. 
Kishon’s two books are humoresque collections: Hinta-Palinta ‘ Seesaw (1956), and Humoreszk 
‘Humoresque’ (1967). Kishon won worldwide fame for his satiric style and humorous depiction of the life of 
the common people in the young State of Israel, which often lead to events ad absurdum (Bachrach 1996). 
These books first appeared in Hungarian and were later translated into Hebrew and other languages. 

I chose these four books, assuming that their dialogs would reflect the realistic vocabulary of daily 
discourse, and thus include Hebrew elements. Following the background described above, I assumed that 
differences would be found between Molnar and Kishon in their use of Hebrew vocabulary, and that these 
differences would reflectlanguage-external sociolinguistic factors. I collected the Hebrew words in these four 
books, counted them, and classified them as types and tokens (occurrences) in semantic groups and 
grammatical- morphological structures (nouns, verbs, discourse markers, etc.) The findings are presented 
below. 


2. FINDINGS. The first and major finding is as hypothesized and expected: the two authors’ use of 
Hebrew words differs greatly. The findings are therefore reported here separately. 


2.1. HEBREW ELEMENTS IN MOLNAR’S BOOKS. The Hebrew words in Molnar’sbooks boil down to a 
single noun, tréfa, which appears altogether seven times, in different grammatical functions (predicate, 
postpositional phrases, and in adjunct structures). This is one of the words integrated in the Hungarian 
language (as mentioned in, e.g., Biro 2004). Here are some examples of this word in Molnar’s books: 


From The Bodyguard.’ 


1. P. 10: 
Nem, Nem, CZ nem tréta 
No, no, this not joke 


‘No, no, this is not a joke’ 


2,. P35: 
Lehet hogy csak bele ment a trétéba 
possible that only in went-3SG the Joke-in 


‘It is possible that she only joined the joke’ 


From The Boys of Paul Street: 


eM eo | 

Ha majd szemedbe mennek a Pdsztorok, elmegy a 

If will eyes-your-in go-3PL. the Pastors Leave -3 the 
SG.FUT. 

kedved a tréf4tol 

Mood-yours the Joke-from 


‘if the two Pastors (boys’ name) appear to your eyes, your mood for jokes/joking will leave you.’ 
In addition, each of these books contains a phrase echoing Biblical heroes and events: 


° The examples are transcribed here exactly as written in Molnar’s books. See the Appendix for some details about transcription 
issues. 


4 JUDITH ROSENHOUSE 


From The Bodyguard. 


4.P. 11-12: 
Hat valj el tolem. Valjunk el, 
Well separate away from-me Separate-1.PL. subjunctive away 


Well separate from me (divorce me). Let us divorce. 


Mondjuk meg egymasnak becstiletesen hogy vége; 
Say-1-PL. Particle one-to-other sincerely that end-its; 
(Subjunctive) 


Let us say to each other sincerely that it’s the end; 


te megy Jobbra én megyek balra 
You gO right-to I go left-to 
you go to the right and I go to the left.’ 


‘Well, divorce me. Let us divorce. Let us say to each other sincerely that it’s the end: you go to the 
right and I go to the left.’ (Cf. Genesis, 13:9 - the dialog between Abraham and Lot before parting 


from each other.) 


From The Boys of Paul Street: 


oe: 17D: 
Ahol Nemecsek __ leteritette Ats Ferit, mint hajdanta Dédvid Géoliatot 
Where Nemecsek  down-struck Ats Feri- like  In-olden- David Goliath- 


ACC. times ACC. 


‘Where Nemecsek struck Feri Ats down, as in olden times David (did to) Goliath.” (Cf. Samuel I, 
17: 37-52, the story of Goliath’s defeat by David.) 


These examples show the following features: 

In principle, the word ¢réfa refers to an impure, inedible animal, according to Jewish dietary rules, but 
in General Hungarian it means a ‘joke, jest, pleasantry, fun’ (Orszagh 1973: 1057). This word can also take 
verb and adjective suffixes, like other borrowed (foreign) nouns; e.g., tréfézni (Orszagh 1973: 1057), 
tréfdlkozni ‘to jest, to speak lightly,’ tréfds ‘amusing, funny, droll’ (Szalai 2011: 437, Orszagh 1973: 1057). 

The original Hebrew meaning of impure animals has ended up in Hungarian as tréfli (Szalai 2011: 437), 
where —// (which occurs also in other words) is suffixed to the original word. Thus, the original lexeme has 
semantically and morphologically split into two forms for secular, non-Jewish connotations and for a 
religious-Jewish meaning. 

In addition, in each of these books, Molnar uses a phrase and personal names, regarding two famous 
figures of the Hebrew Bible. These devices probably reveal Molnar’s trust that both Jewish and Christian 
audience/readers will recognize the Biblical origin; simultaneously, they also reveal his own cultural links 
to the Biblical source. 


2.2. THE FINDINGS IN KISHON’S BOOKS. The number and variety of Hebrew words in Kishon’s books 
is substantially larger than in Molnar’s books. Moreover, there are differences between Kishon’s two books: 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS 5 


the earlier book Seesaw(1956) contains many more Hebrew words than the later book, Humoresque (1967).* 
Table 1 gives an idea of the Hebrew lexical elements (types) in these books. Note that Table 1 does not 
present the total number of Hebrew words (tokens) in the books, which is much larger. Some examples of 
frequent words appear in Table 2.° 


Table 1. Hebrew lexeme types in Kishon’s Hinta-Palinta and Humoreszk 


Word- Nouns & Verbs Discourse  Interrogative | Adjectives Expressions, Collocations, 
group personal markers particles & incl. Biblical 
Book TAINS clauses 
Hinta- 150 (inc. 8 6 Dy, 4 4 5 
Palinta personal + 
ls place 
names) 
Humoreszk 24 (nouns 2 Z 0 4 9 
and 66 
personal 
names) 


Table 2. Hebrew words in Kishon’s transcription, with English translations 


Hungarian transcription Hebrew Translation 

salom avw hello 

rega pa a moment 

szlicha i 20 pardon, excuse me 

seket upw quiet, silence 

Adon / adoni DIN/PIIN Mr. 

pakid Ted a clerk 

pruta MOD Israeli coin of low value (not 
used any more) 

memsala awn government 

kibuc yp kibbutz, kind of settlement 

tov 10 good, ok 

lo NX? no 

bodecem Oxyl actually 

boszeder 702 all right 

bovakasa mwa please 

bachura m7n3 girl, young woman 

bachurim Ea young men 

kupat cholim arin NDP health insurance association 

ribono sel olam avy 2wonl7 God; literally: ruler of world 


The Hebrew words in these books differ in their types and linguistic roles, though syntactically they 
function anywhere in the sentence as subjects, nominal and verbal predicates, complements, etc. Most of 
the Hebrew words in Kishon’s books are nouns, which appear in numerous morphological patterns and 
syntactic roles. They include animate and inanimate nouns and proper names. The latter are given names, 


* The major differences between the Hebrew words in Kishon’s two books are that Seesaw has more Hebrew words 
and that they include more echoes of Biblical and later Jewish literature than Humoresque. 
> The examples are transcribed exactly as written in Kishon’s books. 


6 JUDITH ROSENHOUSE 


surnames, and protagonists names coined by Kishon, as well as animal names (e.g., Kishon’s pet dog) and 
institute names (e.g., health insurance). Toponyms of towns, streets (mainly from the Tel-Aviv area and the 
center of the country) and Israeli regions occur, too. Kishon additionally presents coined humorous 
personal names (which do not serve as proper names outside his works). Some of these words are common 
nouns (but not proper nouns) of conventional Hebrew, e.g., (Mr.) Savuatov ‘good week’, (Mrs.) Ka/anijot 
‘anemones’, while others mix German or Hungarian with Hebrew, as (Mrs.) Ztiszdvas Tradgar which is 
literally ‘sweet’ (German)- ‘honey (Hebrew) - ‘abominable’ (Hungarian). Kishon’s creativity is evident in 
additional ways, see, e.g., térlodds which combines Hebrew for‘line, queue’ with Hungarian ¢d/odés ‘butting 
in, pushing’ and Hungarian tor/dddés ‘heaping, piling up, congestion’ (Orszagh, 1973: 1048). 

Phrases combining Hungarian and Hebrew words (in a Hebrew or Hungarian morphological or 
syntactic structure) are, e.g., izprutds érme ‘a ten-pruta coin’ (Humoreszk, p. 200), where prutais the name 
of an Israeli coin of low value, inherited from old Hebrew.° Hebrew elements can also be embedded in verb 
form by adding the proper suffix, e.g., Ne hochmecolj‘don't be a wise guy, don’t crack jokes’, Humoreszk, 
p. 270, < Yiddish boxme‘wisdom, cf. Modern Hebrew boxma ‘wisdom, wit’, preceding the Hungarian verb- 
creating suffix.’ Such combinations abound in code switching (e.g., Myers- Scotton, 1993, Muysken 2000). 

Other Hebrew elements occur in loan translations (calques), metaphors and collocations, though less 
frequently than free lexemes. Calques may reflect the literal translation of the Hebrew word (Jehuda 
orsozlénok = /arajot jehuda/, ‘Lions of Judas’,”). Kishon also uses various formulas and patterns of traditional 
parables or sayings, for various connotations, e.g.: 


6. Humoreszk, p. 124 
Es az Ur szelleme lebeg a vizek fellet 
And the God spirit-of 3° SG. Ishovering the waters above 


And the Spirit of God was hovering over the face of the waters.’ 
(Genesis, 1:2, English Standard Version).’ 


7. Humoreszk, p.315 


= = 


AZ Ur adta, aZ Ur elvette 
The Lord gave, the Lord away-took 
‘God has given, God has taken away’ 


The Biblical saying (Job, 1: 21) “God has given, God has taken away’ usually describes a loss, mainly a 
person's death, but in Kishon’s Humoresque it describes the problem of forgetting a street name. Further, 
Kishon humorously imitates (in calque) Israelispeakers’ demagogic style of the 1950’s, e.g. 


° Since the book first appeared, pruta has gone out of use in Israel, but the word itself is still sometimes used meaning “a low 

valued coin.” 

” Modern Hebrew expresses this meaning in a single verb form: Aithakem ‘he was being a wise guy’. 

* This is Kishon’s invented name of the imaginary Israeli ice-hockey team, with a humorous allusion to Jacob’s son, Yehuda, 
described in the Bible (Genesis 49:9) as a lion cub. 

° This Biblical verse (Genesis, 1:2) which appears in Humoreszk, p. 124, humorously depicts Kishon’s balcony flooded by 
rainwater. 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS 7 


8. Humoreszk, p. 33 


ezen a torténelmi estén 
This- the historical evening 
INDIRECT- 

ACC 


‘On this historical evening’. 


A rare feature is his humorous simulation of a new immigrant’s Hebrew use of the infinitive jonni‘to come’ 
instead of the (grammatically correct) inflected verb, thus: 


9. Humoreszk, p. 149 


En csak most jonni Romédnidbol 
I just Now to-come Romania-from 
‘I have just come from Romania’. 


Finally, there are two cases of words written in the Hebrew alphabet in Humoreszk. One is Joseph the- 
end (is)-not, which is the axis around which the humoresque revolves:"” 


10. Humoreszk, p. 263 


Josef ha-sof einenu 


Joseph the-end not-3SG. 
Joseph the-end  (is)-nof. 


In addition to nouns, verbs occur in all tenses and moods, but much less often than nouns. They appear 
in collocations or set phrases (e.g., smd ‘listen, m. sg.’, tiszldéch li “excuse me, m. sg.’). Syntactically many 
toponyms take the suffix -4 which makes them adjectives (e.g., Te/-Avivi belonging to Tel-Aviv’) in both 
Hebrew and Hungarian. 

Semantically/pragmatically, the Hebrew words include interjections (e.g., rega ‘one moment!’, mdzd/ 
tov ‘congratulations!’), addresses (without/with a bound pronoun as adon/adoni, ‘Mr.’), request formulas 
(bo vdkdsd ‘please’) and thanks (todd rdbd ‘thank you very much’). Negations (/o‘no, negating a verb form’, 
eyn ‘not, negating a nominal phrase’), assertions (betach ‘sure’), and discourse markers (boecem ‘in fact’), 


etc. also occur. The Hebrew words in Kishon’s books reflect various lexico-semantic groups, as presented 
in Table 3. 


Table 3. Examples of lexico-semantic groups of Hebrew words in Kishon’s books 


Elements of Jewish culture and Modern Israeli terms Interjections for invoking help, 
traditions relating to holidays good wishes, congratulations 
sabat ‘Sabbat’ social structures (kibbutz) Ribono Sel Olém‘O God, lit.: ‘The 
ros hasana, ‘New Year holiday’ governance (memsala), order and Ruler of (the) World’ 

bureaucracy (pakid) mazal tov ‘Congratulations! Good 

luck! 

kapeli ‘skullcap’ popular dishes (fa/afel ‘falafel’, 
Biblical quotations (exs. 6,7 humusz ‘hummus’, uga ‘cake’) 
above) 


'© The original phrase was literally, “Histadrut place in the frame but Yitshag the end is not”, meaning “put “The Union” 
in the frame, but Yitshaq, the end (of the report) is missing” which ended up as, “Yitshaq, the end (of his life has come. He) 
is not (with us anymore):” Missing punctuation symbols caused the blunder. Please note that in the Hebrew version of this 
humoresque, the name is Joseph, not Yitshaq. 


8 JUDITH ROSENHOUSE 


3. DISCUSSION. This paper has examined the Hebrew vocabulary in two books by Molnar and Kishon, 
two world-renowned Jewish-Hungarian authors. In Molnar’s books we found only few occurrences of a 
single word, ‘tréfa,, now a loanword from Hebrew in normative Hungarian and indistinct from original 
Hungarian lexemes. It appears in several grammatical roles and shows a semantic difference between the 
original Jewish meaning and the general Hungarian meaning. In addition, a single allusion to a Biblical event 
occurs in each of Molnar’s books. 

In contrast, Kishon’s books comprise numerous Hebrew words, some of which are from the Biblical 
vocabulary (Classical Hebrew). Others belong to later (Rabbinic) and Modern Hebrew vocabulary, 
including newly coined words (boszeder ‘all right,’ szlichd ‘excuse me, kupdt cholim ‘health insurance 
association’) as well as Biblical words with new meanings or connotations compared to those in the Bible 
(e.g., memsala ‘government’). These lexical devices characterize Kishon’s Hungarian-Hebrew style or 
idiolect, which may be considered Modern Jewish-Hungarian. 

In line with the research assumptions, Molnar’s and Kishon’s attitudes to Hebrew differ. This difference 
can be attributed to sociolinguistic factors. Historically, when Molnar wrote his books before WWI, erets 
Yisra el ‘the Land ofIsrael’was under the rule of the Ottoman Empire, and the State of Israel was still a dream. 
European Zionism was taking its first steps, contending against other Jewish religious, political and cultural 
factions. Most of the Jews for whom Hebrew was important were Orthodox or Hassidic. For them, Hebrew 
was a sacred language, not for daily communication in much of Europe. The literate elite, who published 
secular prose and poetry in Hebrew magazines, newspapers and books, and tried to disseminate its use for 
cultural goals, lived mostly in European lands ruled by the tsar (Poland, Russia etc.), rather than in Hungary 
or Western Europe. 

At the same time, Hebrew was taking its first steps as a modern language in the Land of Israel. Scholars 
and educators (mostly immigrants from Eastern Europe and Middle Eastern counties) were busy adapting it 
to modern life. Adults actively invented and coined new terms and lexical items out of ideological motives, 
in addition to borrowing modern terminology from a few European languages. Their children acquired 
spoken Hebrew mainly from their mates and teachers at kindergartens and grade school (and less from their 
parents), and gradually developed the language to its current status as a native language (cf. Bar-Adon 1977; 
Haramati 1979; Blau 1981; Izre’el 2003). In Hungary, on the other hand, most of the Jews kept Yiddish as 
their language of daily communication until almost the end of the 19" century (cf. Szabé 2002). The 
dominant languages of the country were German, the official Empire language, and Hungarian, as Hungary 
was then part of the Austro-Hungarian Empire. After the Emperor, Franz-Josef I, issued the decree for civil 
rights to all the minorities in the Austro-Hungarian Empire in 1867, the Jews quickly reverted to German 
and/or Hungarian. At the beginning of the20™ century, most of the Hungarian Jews already used Hungarian 
as their main spoken and written language. Many of them distanced themselves from their Jewish culture, 
religion and identity, including the Hebrew language (Patai 1996; Katzburg 1966; Ujvari 1929: 353). Molnar 
was one of the literate Jewish elite; as a child and teenager, he did not study at Jewish schools. He wrote in 
Hungarian and did not emphasize his Jewish origin. Yet, following the increased Anti-Semitism in Hungary 
(and in many parts of Europe) he began to spend extended periods in France and Switzerland, and finally 
immigrated to the USA. 

Judaism and Hebrew had a different meaning for Kishon, who grew up in Budapest in an increasingly 
anti-Semitic atmosphere and survived World War II and the Holocaust period by sheer luck (London 2005). 
After those terrible years, he immigrated to Israel shortly after its establishment, and made it his home as a 
non-religious Zionist (London 2005). Soon realizing that he needed to know the local language to be able to 
make a living in Israel, he acquired Hebrew exceedingly quickly and began working as journalist and 
published humoresques in newspapers. He first wrote for the Hungarian Israeli newspaper Uj Kelet ‘New 
East’, and later, in Hebrew, in ‘omer ‘Saying’, the beginners’ Hebrew newspaper, and Maariv, the most 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS 9 


popular Israeli daily newspaper at that time. He used Hebrew in his works and even innovated many 
Hebrew words and expressions that have enriched the Hebrew vocabulary. Thus, for Kishon, Hebrew was 
not a distant, exotic, historical, liturgical language — it became part of his daily linguistic makeup and skills, 
part of his personality and identity. The motivation, manner of use and scope of the Hebrew elements in 
the literary works differ for Kishon from those of Molnar. 

As mentioned, approximately 300 words of Hebrew origin penetrated Hungarian through the centuries 
— both to its slang and to the normative register (Biro 2004, Blau & Lang 1941/1995). Many of these words 
reflect Jewish traditions, holidays and habits and do not enable free conversation in all the (modern) 
discourse topics. In addition, the grammar in Jewish-Hungarian texts remained Hungarian. Thus, the 
Hebrew words were adaptedto Hungarian phonology, morphology and syntax. This description reflects a 
long process of Hebrew code-switched words in Jewish and non-Jewish speakers’ discourse before their 
total absorption in Hungarian. 

In contrast with Molnar, the words in Kishon’s books and their semantic fields reveal the vibrant life of 
Modern Hebrew and its variety in daily communication. These words reflect the widespread cultural 
elements and notions in Israeli society, mainly in Tel-Aviv and its surroundings. These Hebrew words 
reflect part of the new vocabulary that immigrants (including Hungarian-speakers) had to acquire in the 
country. Most of these Hebrew words are very frequent in daily communication and enable Kishon’s 
Hungarian- readers to learn them in a humorous context. (Kishon often explains them through the 
context). 


CONCLUSION. To conclude, we have been able to answer our two research questions about Hebrew 
elements in written Hungarian texts. We have found many diverse examples of Hebrew words in the 20" 
century literature written in Israel, but hardly any in Molnar’s texts from Hungary, with half a century 
separating them. We have also suggested a sociolinguistic explanation of the findings, i.e., the social and 
political circumstances that changed during the century, including the Holocaust on the one hand, and the 
establishment of Israel and development of the Hebrew language, on the other. 

However, it has also been found that Jewish authors tended to “de-Judificate’ their literature (Heller 
1997), because of increasing anti-Semitism. This tendency is another result of the Jews’ efforts to become full 
Hungarian citizens. The infrequent use of Hebrew elements in Molnar’s books apparently reflects this trend. 

The differences between the books by Kishon and Molnar are seen in both the types of words and their 
distribution. In these works, the Hebrew elements in Molnar’s books reflect Biblical/traditional Hebrew, 
whereas Kishon’s books use the daily vocabulary of Modern Hebrew, which encompasses much more than 
Biblical vocabulary. 

The literature of Jewish languages examines various genres of writing in many languages and periods. 
Each genre, in its time, had (and has) its goals, style, method and circulation. Until the end of the 19" 
century, Jewish literature in Hungary was written in Yiddish, Aramaic, Hebrew, German, and some 
Hungarian. It comprised official and liturgical texts, as well as (less) secular texts such as Hebrew language 
teaching textbooks and children’s stories (cf. Komor6czy 2011). In the 20" century, secular Jewish literature 
in Hungary expanded to more prose (including journalism), poetry and drama in Hungarian. However, 
during the 19-20" centuries (before and after the Holocaust), literary prose and poetry in Hebrew were 
also written by Jewish-Hungarian authors in Hungary and Israel (e.g., Carmilly-Weinberger 1966, 
Rosenbaum 1999). 

For future research, it may be suggested that despite the small number of Hebrew elements in Molnar’s 
two books studied here, his other books, and books by other Jewish Hungarian authors, may comprise 
echoes of their Jewish cultural background, in vocabulary, motives or style (cf. Heller 1997). It may also be 
that the Jewish-Hungarian literature written in Israel and Hungary after the 1960’s is already somewhat 


10 JUDITH ROSENHOUSE 


influenced by Modern Israeli Hebrew (Sanders, 2004). Further research of Hungarian literature written by 
Jewish (and non-Jewish) authors has a wide scope. 


REFERENCES 


BACHRACH, NATHAN. 1996. Immigrating and absorbing in the 1950's in light of Efraim Kishon’s works 
(Olim ve-Qoltim bi-Shnot ha-50 bi-Re’i Yetsirato shel E. Kishon), “Alon La-More Le-Sifrut (Bulletin 
for Literature Teachers) 16, Jerusalem: Ministry of Education and Culture, accessed on 15.4.2015 
from: http://retro.education.gov.il/tochniyot_limudim/sifrut/asil6017.htm (in Hebrew). 

BALLA, ERZSEBET. 1969. The Jews of Hungary: A cultural overview. In Hungarian Jewish Studies, ed. by 
Randolph L. Braham. New York: World Federation of Hungarian Jews, 85-136. 

BANYAI, VIKTORIA & SZONJA RAHEL KOMOROCZY. 2013. Hungarian, Hebrew Loanwords in. In 
Encyclopedia of Hebrew Language and Linguistics, ed. by Geoffrey Khan. Leiden: Brill, Vol. 2, 217-18. 

BAR-ADON, AHARON. 1977. On the nativization of Modern Hebrew and the role of children in this 
process. In Studies in Descriptive and Historical Linguistics: Festschrittfor WintredP. Lehmann, ed. by 
PaulJ. Hopper. Amsterdam: Benjamins, 487-498. 

BENORBUNIN, SARAH. 2009. DoAmericanJewsSpeaka “JewishLanguage’?A ModelotJewishLinguistic 
Distinctiveness, TheJewishQuarterlyReview, Herbert D. Katz Center for Advanced Judaic Studies: 230- 
269. Also: http://www.bjpa.org/Publications/details.cfm?PublicationID=4797 


BIBLE, ENGLISH STANDARD VERSION http://biblehub.com/esv/genesis/1.htm, accessed on June p7th, 
2015. 

BiRO, TAMAS. 2004. Weak Interactions: Yiddish Influence in Hungarian, Esperanto and Modern 
Hebrew. In On the Boundaries of Phonology and Phonetics: A Festschrift Presentedto Tjeerdde Graat, 
ed. by Dicky Gilbers, Maartje Schreuder, & Nienke Knevel. Groningen: University of Groningen, 123- 
45. 

BLAU HENRIK & LANG KAROLY. 1941(19952). Szdjr6l-szdjra, Magyar-Jiddis Szogyiijtemény, (From 
Mouth to Mouth. Hungarian- Yiddish Word Collection.) Papa.: (in Hungarian and Yiddish) 

BLAU, JOSHUA. 1981. The Renaissance of Modern Hebrew and Modern Standard Arabic: Parallels and 
Difterences in the Revival of Two Semitic Languages. California, USA: University of California Press. 

BRAHAM, RANDOLPHL. (ED.) 1969. Hungarian Jewish Studies. New York: World Federation of Hungarian 
Jews. 

CARMILLY-WEINBERGER, MOSHE. 1966. Hebrew poetry in Hungary. In Randolph L. Braham (ed.) 
Hungarian-Jewish Studies, New York: World Federation of Hungarian Jews, 295-342. 

HARAMATI,SHLOMO. 1979. TheBeginningofHebrewEducationintheCountryand its Contribution to the 
Language Revival, 1883-1914 ( Reshitha-chinuch ha- ivriba- arecu-trumatole-hachya atha-lashon, 5643- 
5674).Jerusalem: Rubin Mass (in Hebrew). 

HELLER AGNES. 1997. A Zsidotlanitas a magyar zsid6 irodalomban (Dejudification in the Jewish 
Hungarian literature). In.A Hatédrésa Hatdrolt: Téprengéseka Magyar-Zsido Irodalom Léttormdirol 
(Boundary and the Bounded: Broodings about the Hungarian- Jewish Literature Formations) ed. By 
T6rdk Petra. Budapest: Orszagos Rabbiképz6 Int. Yahalom Zsidé Mtivelédéstérténeti 
Kutatdcsoportja. (In Hungarian) 

IZRE EL, SHLOMO. 2003. The Emergence of Spoken Israeli Hebrew. In Corpus Linguisticsand Modern 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS ll 


Hebrew: Towards the Compilation ofthe Corpus of Spoken Israeli Hebrew (CoSIFH1). Ed. by Benjamin H. 
Hary, Tel Aviv: Tel-Aviv University, The Chaim Rosenberg School of Jewish Studies, 85-104. 

KATZBURG, NATHANIEL. 1966. Hungarian Jewry in Modern Times: Political and Social Aspects. In 
Hungarian Jewish Studiesed. by Randolph L. Braham. New York: World Federation of Hungarian Jews, 
157-70; 

KISHONT FERENC. 1956. Hinta-Palinta - Humoreszkek ( See-saw - Humoresques) Tel- Aviv: Forum (in 
Hungarian). 

KISHONT FERENC(EFRAIM KISHON). 1967. Humoreszk(Humoresque), Tel-Aviv: Tversky (in 
Hungarian). 

KOBANYAIJANOS.2012.A MagyarZsid6Irodalom T 6rténete: KivirdgzvsesKiszantds (The Hungarian Jewry 
Literary History. Blooming and Upheaval.) Budapest: Mult és Jov6 (in Hungarian). 

KOMOROCZY GEZA. 2012. Jewish Historyin Hungary. Budapest: Pesti Kalligram Kft. 2 vols. (in 
Hungarian). 

KOMOROCZY, SZONJA RAHEL. 2011. Yiddish Printing in Hungary: An Annotated Bibliography. Budapest: 
Center for Jewish Studies of the Hungarian Academy of Sciences. 

LONDON, YARON.2012. Kishon - A Biographical Dialog, Tel-Aviv: Ma‘ariv — Hed Artzi (in Hebrew). 

MYERS-SCOTTON, CAROL.1993. Duelling Languages. Grammatical Structure in Codeswitching. Oxford: 
Clarendon Press. 

MOLNAR, FERENC. 1912. A Testér (The Guardsman). Budapest: Franklin-Tarsulat (in Hungarian) 

----, 1906/1955. Pal Utcai Fiuk (The Boys of Paul Street). Budapest: Mora Ferenc Ifjusagi Konyvkiado (in 
Hungarian). 

MUYSKEN, PIETER. 2000. BilingualSpeech:A TypologyofCodeMixing, Cambridge: Cambridge University 
Press. 

ORSZAGH, LAJOS (1973°) Magyar-AngolSzotar(Hungarian-English Dictionary), Budapest: Akademiai 
Kiado (in Hungarian and English). 

PATAI, RAPHAEL. 1996. The Jews of Hungary: History, Culture, Psychology. Detroit: Wayne State 
University Press. 

ROSENBAUM, GABRIEL MOSHE. 1999. The short novel and the pen: Popular Hungarian literature - 
from Budapest to Tel-Aviv, Kesher, 26 (November) (in Hebrew). (A shorter version of the paper, 
accessed 10. 3.2015, is http://baderech.hjm.org.il/Article.aspx/He/TheRoman) 

ROSENHOUSE, JUDITH. 2012A. ISRAELI HEBREW AND HUNGARIAN INTERACTION: 
PHONETIC/PHONOLOGICAL ISSUES. In Rhythm, Melodyand Harmonyin Speech: Studies in Honor of 
Wiktor Jassem, ed. by Dafydd Gibbon, Daniel Hirst, & Nick Campbell, Speech and Language 
Technology, 14/15: 211-224. 

-- 2012b. Two generations of Hungarian and Hebrew in Israel: A sociolinguistic study, Helgat Lashon, 45: 
159-183 (in Hebrew). 

-- 2015. Jewish-Hungarian. In Handbook ofJewish languages, ed. by Aaron Rubin & Lily Kahn, 226-233. 
Leiden: Brill. 

-- (2018). Jewish Hungarian: Pastand present. In The/Jewish Languages: AnInternational Handbook, ed. by 
Benjamin Hary & Sarah Benor. Berlin: De Gruyter Mouton. 

SANDERS, IVAN. 2004. Jewish themes and issues in post-1989 Hungarian literature, Hungarian Studies 
18(2): 213-222. Accessed on 8.11.2015 from: http://www.epa.hu/01400/01462/00032/pdf/213- 222.pdf 

SZABO, VERA. 2002. Yiddish in Hungary. In Jn the Land of Hagar: The Jews of Hungary: History, Society 
and Culture, ed. by Anna Szalai, Tel-Aviv: Beth Hatefutsoth, The Nahum Goldman Museum of the 


12 JUDITH ROSENHOUSE 


Jewish Diaspora, Ministry of Defense Publishing House, 204-205. 

SZALAI, ANNA. 2002. Jewish authors and Jewish themes in Hungarian literature from the 1840stothe 
holocaust. In Jn theLandofHagar: the Jews ofHungary: History, Society and Culture, ed. by Anna Szalai, 
Tel-Aviv: Beth Hatefutsoth, Nahum Goldman Museum of the Jewish Diaspora, Ministry of Defence 
Publishing House, 227-236. 

SZALAI, ANNA. 2011. Hebrew-Hungarian, Hungarian-Hebrew Dictionary, Jerusalem: S. Zack Konyvkiad6 (in 
Hebrew and Hungarian). 

UJVARI, PETER (ED.). 1929. Magyar Zsid6 Lexicon (Hungarian Jewish lexicon),Budapest: Magyar Zsid 6 
Lexikon Kiadasa (In Hungarian). 

VAGO, ROBERT M. 1980. Paradigmatic Regularity in First Language Attrition. In First Language 
Attrition, ed. by William Seliger and Robert M. Vago, 241-51. Cambridge: Cambridge University Press. 


HEBREW IN HUNGARIAN: COMPARISONS OF TEXTS 13 


Appendix. The phonetic symbols used in this paper 


In this paper, we would need phonetic symbols for both the Hebrew and the Hungarian systems, as they are 
not entirely identical. 

However, the Hungarian examples appear in this paper exactly as they appear in Molnar’s and in Kishon’s 
books. Therefore, they are not in any strictly phonetic system. For the consonants of both these languages 
Kishon uses the Hungarian letters and letter combinations cfor /c/), s (for /8/), sz (for /s/), cs (for /c/)), ny 
(for /n//). Kishon’s texts also include words of Hebrew origin integrated in Jewish Hungarian via Yiddish 
(e.g., coresz ‘troubles, problems’ cf. Modern Hebrew /carot/). 

For the Hebrew consonants which are not part of Hungarian, he uses the German ‘ch’ (for Modern Hebrew 
/x/ and /h/). Modern Hebrew has, however, nearly discarded several consonantal features, mainly 
glottalization and pharyngalization. These consonants (phonemes) are still used in general normative 
writing. However, they occur, in the speech of part of the Modern Hebrew speakers, usually those of the 
Sephardic communities. But Kishon does not transcribe these phonemes with the pharyngeal and 
pharyngalized feature. Thus, he uses the letters “t, k”, etc. also for such consonants (e.g. /ta/it/ “prayer 
shawl’). Neither does he distinguish between pharyngeal /*/ and glottal /°/. Like many Israeli native speakers, 
he omits the latter two altogether. See examples in Table 4. 


Kishon’s transcription reflects Modern Hebrew pronunciation also in omitting the pharyngeal /{/ and 


glottal /’?/. See examples in Table 4. 

For the vowels, Kishon adds Hungarian um/aut and accent marks as used in Hungarian orthography, e.g., 
0, 4, é (Table 5). These added signs (also for Hungarian words), however, do not always occur where 
expected in his books, probably due to typo errors. 


Table 4. Transcription of Hebrew words in Kishon’s Hungarian texts 


Hungarian trasncription Modern Hebrew Translation 
sdlom /Salom/ ‘hello’ (lit.: peace) 
szlichd /slixa/ ‘pardon’ 
sdlos vacheci /Salos va- xeci / ‘three and a half 
ddoni Padoni/ ‘Mr’ 
macil /macil/ ‘lifeguard (in a pool) ‘ 
kibucnyik /kibucnik/ ‘a kibbutz member’ 
givdt rdmbdm /giv-at~givat rambam/ ‘Rambam Hill’ 
(named after Maimonides) 
benddém /ben-?adam/ ‘human being’ (lit.: “‘Adam’s son’) 


Table 5. Hungarian vowels in Hebrew words in Kishon’s books 


Gloss Modern Hebrew Translation 
boszeder /beseder/ all right 

tod4 /toda/ Thanks, a thanking 
brécha /brexa/ (swimming) pool 


This article was first published at lacus.weebly.com. 





LACUS Forum 41:4.14-23 (2020) 






LACUS Forumis published on-line at lacus. weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 


THEORETICAL AND PRACTICAL CONSEQUENCES OF CHOMSKY’S 
ASSUMED SEPARATION OF “MENTAL DEVICES” FOR 
LEARNING AND UNDERSTANDING! 
DOUGLAS W. COLEMAN 
University of Toledo 


Abstract: Chomsky (1964:26) assumed the existence of two separate “mental devices” for language 
understanding and language learning. His assumption was later shared by many others. This paper 
shows that the assumption of separateness has failed, both on theoretical and empirical grounds. 
Patches to the theory — notably the addition of the comprehensibility criterion added by Krashen 
(1982:21)— have also ultimately failed as a result of being self-contradictory. This paper points out 
that Chomsky’s original assumption was due to his prior assumption that associative learning 
processes are irrelevant to language learning. A third approach, one based on theories of associative 
learning, actually shows the most promise. 


Keywords: language learning, input, SLA, human linguistics, hard-science linguistics 

Languages: English 
CHOMSKY (1964:26) MADE THE ASSUMPTION that there are two separate “mental devices” (his term) 
for understanding and learning (Figure 1). Although not all theoretical linguistic frameworks make a 


similar assumption, his has been extremely influential, especially within the field of Second Language 
Acquisition (SLA). 


(a) utterance o> A o> oo 
description 

(b) primary o> o> generative 
linguistic data erammar 


Figure 1. Chomsky’s (1964:26) assumed devices for (a) language understanding and (b) 
language learning.* 





' Based on a paper presented at LACUS Forum XLII in August 2015 

* Chomsky sometimes refers to “language learning” and at other times to “language acquisition.” See, e.g., Chomsky 1964:25-26, where 
he refers to “language ... acquisition” and “acquisition of language,” to Chomsky 1966, where he refers to “acquisition ... of language,” 
and to Chomsky 1986:26, where he refers to “acquiring” language; then compare Chomsky 1964:24 and Chomsky 1986:25, where he 
refers to what is “learned.” In other words, he has not over the years made the distinction that some, most notably Krashen (1985, 
2003:passim), have made between “learning” as a conscious process vs. “acquisition” as an unconscious process. 


CHOMSKY’S ASSUMED SEPARATION OF “MENTAL DEVICES” 15 


It is very important to note that Chomsky created the two mental devices — and thus the distinction 
between them — by assumption. His assumptions have significant negative theoretical and practical 
consequences, as is shown below. But first, I'll begin by describing Chomsky’s two assumed devices 
themselves. 

Simply put, device [A] is a mental device, purportedly housed in the brain, which permits a hearer to 
understand language. It does this by assigning a structural description to an utterance. The utterance is 
generally assumed to be speech, which is processed into a phonetic form and then parsed. Chomsky may 
have waffled on the nature of the input to [AJover the years, adding to it not just speech, but also 
nonlinguistic contextual factors; see for example what Sharwood Smith has to say, below. 

Device [B] is a separate mental device, also supposedly housed in the brain, but functionally 
independent from [A], which permits a hearer to learn language. In Chomsky’s first version of his theory 
(1964), device [B] did this by performing operations on primary linguistic data, which Chomsky described 
as well-formed sentences in the grammar of the language to be learned, in order to construct a generative 
grammar. It is important to note that according to Chomsky (1964, 1986, just for example), the primary 
linguistic data does not include anything but well-formed sentences in the language to be learned (see the 
clarification made by Sharwood Smith, discussed below.) After Gold’s (1967) paper appeared, Chomsky 
implicitly accepted his proof — that there could exist no algorithm for [A] that could take primary linguistic 
data as input and produce a generative grammar as output. Chomsky (1968:24) ° patched Universal 
Grammar (UG) onto his theory so that [B] merely had to set parameters in an already fixed system, instead 
of actually having to formulate a rule system. 

A key point, for now, is that Chomsky has all along asserted that [A] and [B] are independent mental 
devices. His descriptions clearly assume that the language learning device does not utilize functions of the 
language understanding device and vice versa. This separation has been emphasized over the years by many 
who accept his assumptions concerning UG, including, just for example, Beebe (1985:404); Sharwood Smith 
(1985:402); White (1989:passim); Saleemi (1992:8); Cook (1993:15); various authors in Flynn, 
Martohardjono, & O’Neil (1998:passim); and Gass & Selinker (2001:401). (This really is just a small 
sampling.) As Sharwood Smith (1985:402, fn. 1) explains, 


It is important to distinguish input to the comprehension system where the listener / reader 
employs a whole network of linguistic and non- linguistic knowledge sources, and input to the 
language acquisition, i.e., “grammar constructing” device, which will not be coextensive with the 
first type of input (cf. Sharwood Smith 1981). 


> There may be a paper or presentation earlier than this one in which Chomsky also introduced Universal Grammar, but Language 
and Mind (1968) is the earliest in which I have been able to find it. It does appear that no explicit reference to UG can be found in 
Chomsky’s work prior to Gold’s paper, bolstering the conclusion that the introduction of UG was a response to Gold’s unlearnability 
proof. We have to infer this, because, apparently, nowhere in his work does Chomsky explicitly state the reason for introducing 
the theory of UG. Further, in Chomsky (1968:24), he says, “The principles that determine the form of grammar and that select 
[emphasis mine] a grammar of the appropriate form on the basis of certain data... might... be termed “universal grammar’.” Contrast 
his use of the term “select” in 1968 tothe use of “construct” or “generate” prior to the appearance of Gold (1967). Further, in later work, 
Chomsky merely refers to UG as a given, as in Chomsky (1986: passim). Although there is widespread confusion as to the origin of 
Chomsky’s theory of UG, it is clear that it is not based on Greenberg's theories of language universals; the two theories have distinct, 
even contradictory elements. As Evans & Levinson (2009:430) put it, however, “Chomsky’s notion of Universal Grammar (UG) has 
been mistaken, not for what it is — namely the programmatic label for whatever it turns out to be that all children bring to learning a 
language — but for aset of substantial research findings about what all languages have in common.” At most, UG piggybacked on the 
late-1960’s popularity of Greenbergian ideas of language universals; it did not in any way adopt those ideas as part of its core. 


16 DOUGLAS W. COLEMAN 


1. THEORETICAL COUNTERARGUMENT TO CHOMSKY’S LANGUAGE LEARNING DEVICE [B]: 
DOMAIN CONFUSION. Although Chomsky refers to the language learning device — [B] in Figure 1 as 
an “abstract device,” he nevertheless attempts to place this “mental device” (another term of his) within the 
domain of human psychology. Chomsky describes the role of the language learning device this way, using 
Saussurian terms (1964:26): The learning model B is a device which constructs a theory G (i.e. a generative 
grammar of a certain /angue) as its output, on the basis of the primary linguistic data (e.g., specimens of 
parole), as input. To perform this task, it utilizes its given faculté de langage, its innate specification of 
certain heuristic procedures and certain built-in constraints on the character of the task to be performed. 
This is a clear case of what Yngve describes as domain confusion, that is, “the confusion between the logical 
and the physical, between grammar and the structure of people” (Yngve 1996:110). 

Note, just for example, how the utterance U consists of a sample of speech (a specimen of parole). Speech 
can be seen in light of the articulatory gestures of a speaker as well as in light of certain sound waves which 
result from those gestures. Either way, U belongs to the physical domain. It is either a series of articulatory 
gestures that we can describe in terms of neuroanatomical events, or it consists of sound waves which we 
can describe in terms of the acoustic properties of energy across various wavelengths varying over time. 
Unfortunately, in Chomsky’s terms, once the UG patch is added to the system, the primary linguistic data is 
also assumed to carry with it a grammatical structure, otherwise there can be no way for it to trigger the 
appropriate parameter settings in UG. This is where the domain confusion comes in. The only physical 
domainstructural description we can assign to the articulatory gestures of an utterance is in terms of human 
anatomy and physiology. The only physical domain structural description we can assign to the resulting 
sound waves is an acoustic one: energy across a range of wavelengths varying over time. The primary 
linguistic data of the Chomskyan construct — well-formed sentences containing grammatical structure — 
is something abstract, something in the logical rather than physical domain. 

Chomsky has been known to purposely conflate physical and mental, most notably in Knowledge of 
Language (Chomsky 1986:7), where he refers over and over to a speaker’s “mind/brain.” In that book he 
states: 


Certain mysteries are now known to us: the mystery, in part, of the universe and the mystery of the 
mind have been in a sense revealed outof the heart of darkness. Mind and matter, mind and brain, 
have converged; space, time, and motion are reconciled; man, consciousness, and the universe are 
reunited since the atom in a star is the same as the atom in man. (Chomsky 1986: xxii) 


In On Nature and Language (2002:68), he reinforces this view, which he describes thus: 


On the factual side, there is no longer any concept of body, or matter, or “the physical.” There is just 
the world, with its various aspects: mechanical, electromagnetic, organic, mental — categories that 
are not delimited in an a priori way, but are at most conveniences. In each of the shifting domains 
of constructive inquiry, one can try to develop intelligible explanatory theories, and to unify them, 
but no more than that. 


It is easy enough to say that distinction between the domains of chemistry and biology as fields of study 
is one created by human beings. It is another thing altogether to say that an individual’s subjective 
perception of reality (mental experience) is indistinguishable from objective reality. Yet, this is Chomsky’s 
position. It is not only his position. He claims it is the standard position of science. In the same place, 
Chomsky makes this rather remarkable assertion about the nature of science, claiming that it goes back to 
the 17" century: 


CHOMSKY’S ASSUMED SEPARATION OF “MENTAL DEVICES” 17 


The standard that inspired the modern scientific revolution was abandoned; the goal [now] is 

intelligibility of theories, not of the world [emphasis added] — a considerable difference, which 

may well bring into operation different faculties of mind, a topic some day for cognitive science, 

perhaps. (Chomsky 2002:68) 

In sum, Chomsky’s view of science has two key features that separate it from the view of most working 
scientists today. He accepts these two statements; they do not: (1) there is no difference between the physical 
domain and abstract domains, and (2) science is not concerned with understanding the world, but only 
with understanding its own theories.* 

Some Chomskyans could avoid the domain confusion by responding that the grammatical structure is 
not “in” the sound waves, but that UG makes a person perceive grammatical structure in the sound waves. 
Carroll seems to be in this more pragmatic group. As she puts it, 

[B] omprehending speech is something which happens as a consequence of a successful parse of the 

speech signal. Before one can successfully parse the L2, one must learn its grammatical properties.” 

(Carroll 2001:9) 

In other words, UG supposedly causes a person fo perceive grammatical structure in the sound waves inorder 
to make the appropriate parameter settings when acquiring a language the person does not yet know.® As we will 
see below, however, there is evidence to the contrary. 


2. A PRACTICAL COUNTERARGUMENT TO CHOMSKY’S LANGUAGE LEARNING DEVICE [B]. 
Klein (1986) presented a convincing thought experiment that seems to serve as a counter to Chomsky’s 
theory of a language learning device with UG as well as to Carroll’s apparently more real-world variant of 
it. He laid out this thought experiment in very straightforward terms: 


Suppose you were locked in a room and were continually exposed to the sound of Chinese coming 
from a loudspeaker; however long the experiment continued, you would not end up speaking 
Chinese What makes learning possible is the information received in para//e/to the linguistic input 
in the narrower sense (the sound waves [generated by speech]). (Klein 1986:44) 


Unless you disagree with the outcome Klein suggests — and believe that the person cou/d end up speaking 
Chinese under the conditions he describes — the thought experiment seems to show that primary linguistic 
data (even if the construct were not domain confused) would be inadequate for language learning. 

In fact, I have spoken to some who so strongly believe in UG that they are convinced that a person 
would at least learn grammatical structure in Klein’s Chinese Room. A few years ago, I suggested the topic 
to a research team in a course I teach on applied linguistics research methods. One student from the group 
ended up writing her thesis on the topic (Xin 2012). With her research team in that course, she conducted an 


*Chomsky’s writing is rife with such domain confusions. Just as another example, in Rules and Representations (1980:32-33), he 
writes of the ability to acquire language as a “biological endowment” yet refers to the development of the body as an “analogy” for 
the growth of language, as if “language growth” in an individual were nota matter of human biology, specifically, a matter of changes 
in the human body. 

> Carroll’s argument has a key hidden assumption: to understand speech, one must be able to grammatically parse it. (Her 
assumption would seem to require that a native speaker of English should not be able to understand the non-native English speaker 
who says “Cheese mouse eat” or “Cat rat kill.”) Krashen, on the other hand, assumes that one can learn appropriate order of speech 
behaviors (e.g., “The mouse eats the cheese”) and the accompanying redundancy markers (e.g., “the”, “-s”) by hearing and 
understanding examples produced by native speakers. 

° To beat a dead horse: this means that UG can set the appropriate parameters because it can recognize grammatical structure in 
speech the first time we are exposed to it, and understanding that speech has nothing to do with this ability. 


18 DOUGLAS W. COLEMAN 


experiment comparing the learning of 76 participants exposed to “primary linguistic data” only (speech and 
text) vs. 71 participants who were exposed to speech, text, and relevant parallel sensory input (a set of 
detailed storyboard illustrations). The stimulus lasted only several minutes and was ina language completely 
unknown to any of the participants. Those in the text-and-speech-only group did not perform above chance 
levels on measurements of their learning in terms of understanding or in terms of their knowledge of target 
language structure. Those in the text- speech-and-storyboard group performed above chance on both 
measures. Direct comparison of the results reconfirmed that the second group (those receiving parallel 
sensory input) performed measurably better than the first. In short, the results of the experiment involving 
147 participants confirmed Klein’s thought experiment in a setting with real people actually learning a small 
subset of a language. Klein’s Chinese Room thought experiment and the real-world experiment modeled 
on it show that the input for language learning is not primary linguistic data, but rather, as Klein claims, a 
combination of linguistic and nonlinguistic input that includes speech (and/or text) accompanied by aural, 
visual, tactile, olfactory, kinesthetic, and other sensory input. 


3. THE RELEVANCE OF UNDERSTANDING TO LEARNING. It seems strange, to put it mildly, to doubt 
that understanding has something to do with learning a language. Yet, even most who champion 
communicative approaches are among those who accept or claim to accept Chomsky’s view in Figure 1. 
Even those who focus on input-based teaching approaches, — e.g., Asher (1977), Krashen (1985) and others 
— have refused to discard Chomsky’s assumption of “separate mental devices.” Although they say SLA 
requires comprehension, they still accept that it occurs through a separate cognitive function. For example, 
when Krashen (1982:32) explains the function of the affective filter in his theory, he uses a diagram that is 
a very closely adapted version of Chomsky’s (1964:26) original.’ Krashen substitutes new terminology, for 
example, “input” for “primary linguistic data.” However, it is clear from various sources that Krashen intends 
“input” to refer to “language that contains structures that we have not yet acquired” (see, for example, 
Krashen 1982:21). He tries to maintain agreement with Chomsky’s view by defining input in this way, that 
is, by treating “context, our knowledge of the world, our extra-linguistic information” (p. 21) as something 
that occurs outside the process shown in Figure 2. 


Filter 


Language 


Acquired 
Input C» Acquisition -» aie bexce 





Figure 2. Krashen’s View of the Role of Input in Learning. 


Although he avoids a head-on confrontation with Chomskyan theory, Krashen creates two problems 
for himself. 

The first is that he still implicitly contradicts Chomsky’s view. His diagram (Figure 2) superficially 
seems to match up with Chomsky’s (Figure 1), with the addition of only the affective filer (“Filter” in Figure 
2). However, Krashen has steadfastly maintained that other material is necessary for language learning in 


“Krashen states in the version of Principles and Practice (Krashen 1982: “Introduction to the Internet Version,” preceding p. i) on 
his web page (accessed 2015) that his position on the role of input, the affective filter, etc. has not changed. Comparison with 
Krashen (2003) and more recent work bears this out. 


CHOMSKY’S ASSUMED SEPARATION OF “MENTAL DEVICES” 19 


order to make the input comprehensible (see, e.g., Krashen 1982:20-30, 1985: passim, 2003:4-14). He doesn't 
call this other material “input,” but does say that it must be present in order for new language to be 
comprehensible, and thus able to be acquired. According to Chomsky’s view, remember, “primary linguistic 
data” is sufficient and understanding occurs through a completely different mental process. Krashen’s first 
problem, therefore, is that his theory says it agrees with Chomsky’s theory when it really does not.* 

The second problem arises largely because of the first. Because Krashen marginalizes “context, our 
knowledge of the world, our extra-linguistic information” (1982:21) within his theory, no matter how much 
he emphasizes it in practice, there is no way for him to show in formal terms how it relates to the cognitive 
functions of learning. The cognitive and perceptual elements that “make input comprehensible” are literally 
missing from the picture in Figure 2. 

The result is that Krashen, like others who claim to retain Chomsky’s basic formulation but try to tweak 
it through implicit riders, can say nothing specific about how comprehension is related to learning. Next, I 
will show how (very briefly) this actually works out in practice. 


4. LEARNING WHAT IS IN THE INPUT IS NOT THE SAME AS COMPREHENDING THE INPUT. 
Chomsky claims that understanding is via an entirely different mental device from understanding; he thus 
claims that only exposure to the appropriate input triggers UG, and thus learning.’ Krashen claims that 
receiving comprehensible input triggers unconscious learning (“acquisition” in his terms). This seems to 
work. For example, images like those in Figure 3a and 3b can allow someone to learn that the object 
pictured in Figure 3a is called “shki,”, while the object in Fig 3b is called “psoshje.” 





a. Ke shki. b. Ke psoshje. c. Kse ke? 


Figure 3. Comprehensible Input (a / b) and a Comprehension Question (c). 

The person will be able to answer if the statements “Ke shki.” and “Ke psoshje.” are later replaced with the 
‘What is it?’ question “Kse ke?” under each image. Specifically, the person will answer “Ke shki” for 3a and 
“Ke psoshje” for 3b and 3c. The ability to respond in this way demonstrates that the person comprehends 
the input. The problem for Krashen is thatthe same person who comprehends the text/speech in Figure 3b 
will not be able to correctly respond to the same question in Figure 3c if the target language has scope of 


* Given the hostile environment toward non-Chomskyans that existed in the US when Krashen was first formulating his theories, 
it seems very likely that using this seemingly legalistic ploy of defining input so as to exclude part of what had to come in to the 
learner’s cognitive system was a kind of self- defense. 

° Although he has many times used “learning” to refer to the process conventionally referred to as “learning a language,” Chomsky 
has also many times denied that “learning” accurately describes that process. This denial clearly arose as a reaction to Gold’s 
unlearnability proof, with Chomsky’s incorporation of UG into his theory. See, for example, Chomsky (1980:33); (1986:26, 150-151); 
(1999:49);(2000:23); (2002:64). His position is that “learning” as normally understood does not occur with regard to language, merely 
parameter setting. 


20 DOUGLAS W. COLEMAN 


reference similar to that of Chinese or Japanese, or for that matter, not even if it has scope of reference similar 
to that of French or Spanish. This is because in none of those languages is the object in Figure 3c referred 
to in the same way as the object in Figure 3b. In Chinese and Japanese, the object in Figure 3c is referred to 
by the same term as is a sofa but nota chair like the one in Figure 3b; in French and Spanish, a unique term 
differentiates the object in Figure 3c from sofas and chairs like the one in 3b. In the language in question, the 


appropriate response for the question in Figure 3c is “pamatpa,” not “psoshje.”’° 


5. IT REALLY IS ABOUT ASSOCIATIVE LEARNING. Chomsky staked out a territory in his well- known 
attack on Skinner’s behaviorism (Chomsky 1959:passim) that repeatedly disclaimed any relevance of 
associative processes to language learning. He has maintained the same position over the ensuing decades. 
As he says, 


the data available are much too impoverished to determine [our knowledge of language] by any 
general procedure of induction, generalization, analogy, association, or whatever. There is good 
reason to believe that children learn language from positive evidence only... and they appear to know 
the facts [about language] without relevant experience in a wide array of complex cases (Chomsky 
1986:55) 


Chomsky is referring here to the so-called Poverty of Stimulus (Chomsky 1986:XX). However, 
remember that Chomsky assumes that input contains only “primary linguistic data.”'' He excludes by 
assumption the parallel sensory input shown to be necessary by Klein’s Chinese Room thought experiment 
and by Xin’s actual experiment that followed up on Klein. This is the reason I have said elsewhere that it is 
not the input itself that is impoverished, but Chomsky’s view of the input (Coleman 2007:120). Coleman 
(2007:120-27) shows that effective input for SLA must provide breadth and limits of association between 
communicative behaviors (e.g., speech) and meaning. 

Here I associate “meaning” — in regard to the cognitive events in a single individual — with what 
Yngve’s (1986:176) model describes in terms of “domain of control,” which he defines as a region of “the 
active procedures and those that stand ready or nearly ready to be triggered.”'* The triggers that activate 
these procedures could originate from external sources (Klein’s parallel sensory input) or internal sources 
(currently activated memories). Part of what is activated at any moment can be modeled in terms of 
orthoconcepts (Yngve 2006:passim, 2009:315-16; Yngve & Burazer 2007:409-10). Using the diagrammatic 
formalization of Yngve (1996:246-74), Coleman (2006:164-68) shows how the relevance of parallel sensory 
input to associative learning can be modelled in very precise terms. 

6. CONCLUDING REMARKS. Chomsky’s model assumes there are separate cognitive processes for 
language understanding and language learning. He further assumes that the inputs for the two systems are 
quite different — primary linguistic data for one, an utterance for the other. It is important to keep in mind 


'° In fact, the examples are in an artificial language invented for experimental purposes; however, the scope of reference for “psoshje” 
vs. pamatpa’” is the same as that for “yizi’ vs. “shafa” in Chinese. Thus, the appropriate answer to “Kse ke?” under a picture of a sofa 
would also be “pamatpa.” Other complications arise in other languages: in French, for example, 3c would elicit “fauteuil”, while a 
picture of a sofa would elicit “canapé”. 

'' Two mysteries arise. The first is how ungrammatical and incomplete sentences in the input can be identified by UG before the 
language is learned (compare English “It is January” vs. Spanish “Es enero”). The second is how we can learn to speak in elliptical 
sentences if language learning does not in part depend on being exposed to them (e.g., “A: Where are the keys? B: On the table in the 
hall.”). 

2 What Yngve calls the “domain of control” thus models the physical domain state of a person, providing a model that can be 
employed to specify how the person’s understanding of input goes beyond factors external to that person. 


CHOMSKY’S ASSUMED SEPARATION OF “MENTAL DEVICES” 21 


that he uses the terms “primary linguistic data” and “utterance” in very different ways; they are not simply 
different ways of referring to the same thing. Chomsky is explicit in differentiating between the two “mental 
devices,” and he has essentially stuck to the position he laid out in 1964 with regard to the relationship of 
language learning and language understanding; the introduction of UG did not change that. Many others, 
as I have shown, have taken the path of adopting his position. Klein’s Chinese Room thought experiment and 
a real experiment modeled on it both show that position is untenable, however, because primary linguistic 
data is inadequate for acquisition. The path is a dead end. 

Others, like Krashen and his followers, have claimed to accept the theory but have implicitly 
contradicted it. This has led to apparent successes in practice. However, the internal contradiction in their 
path leads to an inability to systematize improvements and to effectively control the design of language 
learning experiences, since it leaves a key element out of the theoretical formalization: associative learning. 

A third path exists. Taking this approach has yielded up the constructs of breadth of association and 
limits of association (Coleman, 2007:121) which have been shown to improve the design of learning 
materials. This path means following the methods of standard science, avoiding the unwarranted, 
discipline-specific assumption promoted by Chomsky — that language understanding and language 
learning are separate, unrelated processes.'’ We should try looking at language learning in light of all that 
is known about human learning in other spheres. Preliminary work cited above shows that there is a great 
deal of promise down the third path. 


'S It is worthy of note that Chomsky never attempted to justify the assumption of separate “mental devices.” His construct of Poverty 
of Stimulus is an axiom of that assumption (see above). UG was not established by any evidence; rather, Chomsky justified UG 
because Gold showed that Chomsky’s prior assumptions were provably false. By adding UG, Chomsky thought he was able to defend 
his earlier assumption about the separateness of the “mental devices” for language learning and language understanding from Gold’s 
proof. Allthis has followed from Chomsky having assumedfrom the very beginning that learning to speak and understand a language 
does not involve associative learning. However, Klein’s Chinese Room thought experiment shows that it does. 


22 DOUGLAS W. COLEMAN 


REFERENCES 


BEEBE, L. M. 1985. Input: Choosing the right stuff. In Jnput in Second Language Acquisition, ed. by S. M. 
Gass & C. Madden, 404-14. Rowley, MA: Newbury House. 

CARROLL, S. E.2001. Jnput and Evidence: The raw material of second language acquisition. Philadelphia: 
John Benjamins. 

CHOMSKY, N. 1959. A review of B. F. Skinner’s Verbal Behavior. Language 35.1:26-58. Reprinted in 
Readings in the Psychology of Language (1967), ed. by L. A. Jakobovits & M.S. Miron, 142-43. 
Englewood, NJ: Prentice-Hall. 

---. 1964. Current Issues in Linguistic Theory. The Hague: Mouton. 

---. 1966. CartesianLinguistics:A chapterin thehistoryofrationalistthought. NewYork: Harper & Row. 

---. 1968. Language and Mind. New York: Harcourt, Brace, & World. 

---. 1980. Rules and Representations. New York: Columbia University Press. 

---. 1986. Knowledge of Language: Its nature, origin, and use. New York: Praeger. 

---, 1999. On the nature, use, and acquisition of language. In Handbook of Child Language Acquisition, 
ed. by W. C. Ritchie & T. K. Bhatia, 33-54. New York: Academic Press. 

---, 2000. Linguistics and brain science. In Jmage, Language, Brain: Papers trom the First Mind Articulation 
ProjectSymposium,ed.by A. Marantz, Y Miyashita, & W.O’Neil, 13- 28. 

---, 2002. On Nature and Language, ed. by A. Belletti & L. Rizzi. Cambridge, UK: Cambridge University 
Press. 

COLEMAN, D. W. 2006. A formal integrated view of speech, gesture, gaze and its implications for 
learning. LACUS forum 32:163-172. 

---, 2007. Required variability in input. LACUS forum 33:119-31. 

COOK, V. 1993. Linguistics and Second Language Acquisition. New York: St. Martin’s. 

EVANS, N. & S. C. LEVINSON. 2009. The myth of language universals: Language diversity and its 
importance for cognitive science. Behavioral and Brain Sciences 32:429-92. 

FLYNN, S., G. MARTOHARDJONO, & W. O'’NEIL (Eds.). 1998. The Generative Study of Second 
Language Acquisition. Mahwah, NJ: Lawrence Erlbaum. 

GASS, S. M. AND L. SELINKER. 2001. Second Language Acquisition: An Introductory course.Mahwah, NJ: 
Lawrence Erlbaum. 

GOLD, E. M. 1967. Language identification in the limit. Jnformation and Control 10:447-74. 

KLEIN, W. 1986. Second Language Acquisition. New York: Cambridge University Press. KRASHEN, S. D. 
1982. Principles and Practice in Second Language Acquisition. Internet Edition accessed: 
http://www.sdkrashen.com/content/books/principles_and_practice.pdf 

---. 1985. The Input Hypothesis: Issues and Implications. New York: Longman. 

---, 2003. Explorations in Language Acquisition and Use: The Taipei lectures. 

Portsmouth, NH: Heineman. 

SALEEMI,A.P. 1992. UniversalGrammarandLanguageLearnability.New York: Cambridge University 
Press. 

SHARWOOD SMITH, M. 1985. From input to intake: On argumentation in second language acquisition. In 
Input in Second Language Acquisition, ed. by S. M. Gass & C. Madden, 394-403. Rowley, MA: 
Newbury House. 


CHOMSKY’S ASSUMED SEPARATION OF “MENTAL DEVICES” 23 


---. 1994. Second Language Learning: Theoretical foundations. New York: Longman. SWAIN, M. 1985. 
Communicative competence: Some roles of comprehensible input and comprehensible outputinits 
development. In /nputin SecondLanguage Acquisition, ed. by S. Gass and C. Madden, 235-56. Rowley, 
MA: Newbury House. 

WHITE, L. 1989. Universal Grammar and Second Language Acquisition. Philadelphia: John Benjamins. 

XIN, Y. 2012. Exploring the Chinese Room: Parallel Sensory Inputin Second Language Learning. MA thesis, 
University of Toledo. Available on-line at the OhioLink ETD Center: 
http://rave.ohiolink.edu/etdc/view?acc_num=toledo1333762798. 

YNGVE, V. H. 1996. From Grammar to Science: New foundations for general linguistics. 

Philadelphia: John Benjamins. 
---, 2006. Formalizing the observer in Hard-Science Linguistics. LACUS forum 32:267- 76. 
---. 2009. People, orthoconcepts, and dialog. LACUS forum 34:313-19. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:4.24-43 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





HUMAN AND MACHINE TRANSLATION QUALITY: 
DEFINABLE? ACHIEVABLE? DESIRABLE? 
ALAN K. MELBY 


Brigham Young University 


Based on the presidential lecture delivered on August 
10, 2012, at the occasion of LACUS XXXIX, York 
University, Toronto 


THE DEBATE OVER MACHINE TRANSLATION (MT) began as the first general-purpose electronic 
computers were being built in the late 1940s. Warren Weaver (1955[1949]), then director of the Natural 
Sciences Division of the Rockefeller Foundation, wrote a highly influential memo suggesting that the use of 
computers in translation should be explored. Research projects aimed at programming those early 
computers to translate were already springing up in the 1950s. In the first period of work in machine 
translation, the goal was Full-Automatic High-Quality Translation (FAHQT), a term attributed to Bar- 
Hillel (1960), one of the first full-time researchers in machine translation. 

During the Cold War, the desire to access research found in scientific journals published in Russia led to 
considerable government funding for machine translation, especially Russian-to-English, during the 1950s 
and 1960s, until the scathing ALPAC (1966) report was published. The ALPAC report claimed that the 
quality of machine translation was very low and recommended a termination of funding for it. 

Throughout the history of machine translation, the question of quality has been constantly pursued, yet 
ill-defined. 

After a long period of relatively little work on machine translation following the publication of the 
ALPAC report, at least in the United States, development efforts revived in the 1980s, along with an 
alternative approach, namely, the use of computers as tools in the hands of human translators rather than as 
competition. Since then, a spectrum of human and machine involvement in translation has been in place, 
with the need for translation increasing because of factors such as increased global trade. Points on the 
spectrum from one extreme to another are sometimes labeled as follows: 


FAHQT: fully automatic high-quality machine translation with no human involvement; 

HAMT (Human-Assisted Machine Translation): MT plus post-editing and/or pre-editing; 

MAHT (Machine-Assisted Human Translation): human asks computer for help, as desired; 

HTLGI (Human Translation, unassisted by any technology, “As God Intended”); This extreme was 
proposed tongue-in-cheek from the beginning in order to provide symmetry with the other 
extreme of machine translation unassisted by humans; few human translators today avoid all 
technology, typically using at least word processing, electronic dictionaries, email, and Internet 
search engines. 


HUMAN AND MACHINE TRANSLATION QUALITY 25 


In this paper, machine translation will be used to refer to fully automatic translation, regardless of its 
quality. The term raw machine translation is used to emphasize that no human help has yet been enlisted 
to correct errors in the output of the system. 

Machine translation is at the center of HAMT. An entire text is translated automatically, and then, 
typically, a human post-editor fixes mistakes in the raw machine translation, until sufficient quality is 
reached. 

Human translation is at the center of MAHT. The human translator has a variety of optional resources 
available, such as entire sentences from a database of segments of text and their human translation, 
automatically retrieved when they are identical or similar to the current sentence being translated. 
Terminology is automatically looked up. And, in some cases, a machine translation of the current sentence 
is displayed, but it can be ignored at the option of the translator if it is of insufficient quality to edit. 

The question of quality comes up repeatedly in discussions of machine translation. But what is 
translation quality? Translators often reject machine translation because of its low quality. Machine 
translation researchers sometimes claim that the quality of human translation is not so high. Yet there is no 
widely accepted definition of translation quality that applies to both human and machine translation and is 
used to measured quality reliably. 

One purpose of this paper is to contribute to the debate about the role of humans and machines in 
translation by proposing and applying a new definition of translation quality. 

The intended audience of the paper is quite broad, including: 

academics who study language and are called linguists, 
government language workers, who often translate and who are also called linguists, and 
professional translators and translation project managers. 

The structure of the rest of the paper is a sequence of three questions about translation quality and 
proposed answers to them. 


Definable? Is translation quality definable? Not everyone thinks so, but I do. My definition builds 
on a standardized framework for developing structured translation specifications, which will 
be explained. 

Achievable? Is quality achievable using machine translation or HAMT? This question will be 
answered in terms of the proposed definition of translation quality. 

Desirable? Is it desirable to attempt to build a machine translation system whose raw output meets 
or exceeds the quality of professional translators in the most demanding cases? Suppose it is 
desirable. Are there less challenging but still useful goals to be pursued while the ultimate 
machine translation system is under development? Again, this question will be answered in 
terms of the proposed definition by defining two translation-specific variations of the classic 
Turing Test. 


In the conclusion of this paper, I will make some predictions concerning machine translation in the 
year 2029, but more importantly, I will suggest some short-term action items for those who are willing to 
give the new definition of translation quality a try. 


1. DEFINABLE? Translation quality is notoriously difficult to define. Consider the following quote from a 
European Commission document about the cost of poor translation quality in a practical translation 


environment: 


Although, like quality in general, quality in translation is a somewhat elusive concept, poor quality 


26 ALAN K. MELBY 


translations are in some—though not all—cases rather easy to detect. At best, a poor or less 
fortunate translation makes the reader shake his head and smile at a poorly translated sentence, but 
errors in translation can also have serious legal, financial or political consequences. (European 
Commission 2012:13) 


For a survey of a number of mutually incompatible approaches to translation quality in the translation 
studies literature, I refer the reader to House (2001), even though I disagree with her definition of translation 
and her characterization and quick dismissal of Skopos theory, as proposed by Reiss and Vermeer. We will 
later return to Functionalism in translation studies as a descendant of Skopos theory. 


1.1. COVERT VS. OVERT TRANSLATION. An important distinction made by House is between covert 
and overt translation (2001:249-50, 2010:245-46). An overt translation, as the name implies, makes no 
attempt to hide the fact that it is a translation. According to House, in an overt translation the translator is 
asked to “give target culture members access to the original text and its cultural impact on source culture 
members”. In contrast, a covert translation is an attempt to create a document that corresponds to the 
source text while appearing to have been authored in the target language and culture. As House points out, 
“The result may be a very real distance from the original”. A commonly committed error encountered over 
the centuries in attempting to define translation is the tendency to privilege one type of translation, such as 
overt or covert, in the definition, as the only right way to translate. Is a quality translation overt or covert? 

For example, an overt translation might leave some culture-specific concepts untranslated, while a 
covert translation might make considerable adaptation. Although 

J.R.R. Tolkien’s Lord of the Rings is not actually a translation (it is rather a literary “pseudo- translation”, 
l.e., it portrays itself as a translation of an ancient text), Tolkein’s appendices (particularly Appendix F, 
section II, “On Translation”) explain how he adapted names and concepts from his putative ancient source 
in order to make the text approachable for a modern audience (i.e., making a covert translation). For 
example, he “Englished” many place and character names (i.e., created familiar-sounding equivalents for 
them), such as English SAire for Stiza and Sam as the equivalent of Ban, the short form of Banazir in the 
fictional language of the Hobbits (Tolkein 1965[1955]:477-79). Even if The Lord of the Rings uses 
translation only as a literary device, these notes ring true to the issues translators do face when encountering 
names or concepts that are outside the world knowledge of the target audience. Translators often face the 
dilemma of what to translate into familiar terms and what to translate in unfamiliar terms, because the 
concepts (rather than the words) involved may themselves actually be foreign. 

In the case of a real translation (albeit a subtle one between two varieties of English), the U.S. publisher 
Scholastic changed the name of the first Harry Potter book from Harry Potter and the Philosopher's Stone 
to Harry Potter and the Sorcerer's Stone for the U.S. release and translated other cultural terms into U.S. 
English (e.g., crumpetto English muffin and jumper into sweater), a very covert sort of translation in the 
sense that most readers would never know the changes had taken place. The U.S. editor, Arthur Levine, was 
quoted as saying: 


I wasn't trying to, quote, “Americanize” them. What I was trying to do is translate, which I think is 
different. I wanted to make sure that an American child reading the books would have the same 
literary experience that a British kid would have. A kid should be confused or challenged when the 
author wants the kid to be confused or challenged and not because of a difference of language. 
(quoted in Radosh 1999) 


An examination of various editions shows that it was not only American translators who made such 


HUMAN AND MACHINE TRANSLATION QUALITY a7. 


adaptations: Jean-Francois Ménard, the French translator, for instance, invented names to try to make the 
text accessible for French readers, and thus “Snape became Rogue, Slytherin became Serpentard, and the 
British word Bagman became Verpay, from the acronym VRP, describing someone engaged in door-to- 
door sales” (Goldstein 2004, edited). 

Although the target audience for Harry Potter books seemed unphased by these covert translations, not 
all readers were happy, leading to criticism, particularly of the American adaptations as cultural imperialism 
or “dumbing down” for an American audience (see, e.g., Gleick 2000). Given thedifference in expectations of 
the target audience (children reading “juvenile literature” with little or no interest in the theory of translation 
and culture) and the professional translator critics, for whom such concerns are vital, it is difficult to imagine 
any translation on the overt-covert continuum that would have satisfied all readers. 

At the other end, an overt translation might resort to an extensive apparatus of notes and other 
explanatory materials external to the translated text itself in order to render it accessible to readers. For 
example, the Chinese translation of James Joyce’s Ulysses was a very overt translation and employed 5,991 
footnotes (the greatest number of footnotes ever in a Chinese-language book) in an effort to explain the 
linguistic and cultural intricacies of Joyce’s text to Chinese readers (Murphy 1995), perhaps leading to 
questions about the boundary between “translation” and “analysis”. 

Some scholars argue that the choice between overt and covert is not just a pragmatic one, but also an 
ethical one. Lawrence Venuti (1998, 2009), for example, a well-known translation theorist, describes the 
overt-covert distinction as the tension between a foreignized and a domesticated translation and has 
criticized Functionalism on several points. He maintains that part of the duty of the translator is to make the 
foreignness of the text apparent to readers, even at the expense of readability and accessibility. Venuti sees 
this position as an antidote to absolutist positions concerning translation that seek to domesticate it. A 
description of these criticisms and responses to them is found in Hague, Melby, & Wang (2011). 


1.2. DEFINING TRANSLATION. To the above evidence of the difficulty of defining translation quality, I 
add an anecdote about Roy Harris, who later became a respected professor of general linguistics at the 
University of Oxford, as told by David Bellos (2011:3): supposedly, early in his career at Oxford, Harris was 
assigned to teach translation but got out of it by refusing to teach something the faculty board could not 
define. Not only is translation quality difficult to define, but translation itself is. 

Thus, I take a small detour to define translation. First, I will assume some building blocks, namely, 
source language and target language. One translates from the source language into the target language. 
Source text and target text are the respective documents one begins with and creates. Even this much is 
controversial, for Roy Harris disputes the notion of a language. Actually, there is much to dispute, and 
commercial translation companies recognize the need to distinguish among locales, which are geographic 
regions, such as the United States and the United Kingdom, together with their various conventions such as 
for dates (is “8/12” the 12" of Augustor the 8” of December?), times, and currency. English does not exist as 
a well-defined object; witness the many differences between American and British English. Nevertheless, I 
will move on and use source/target language and source/target text in the rest of this paper, without apology. 

A welcome point of agreement in the world of translation is that the word translation exhibits a basic 
ambiguity between the processof translating and the product(i.e., the result) of translating. However, I part 
ways with most definitions of translation because they use the transfer metaphor, as in defining translation 
as the transfer of meaning from a source text to a target text. Elsewhere, my colleagues and I have discussed 
at length the notion of meaning as it relates to translation (Melby, Manning & Klemetz 2005) and have 
shown, we believe, that meaning does not reside in a text but rather in the mind of a human. If we are right, 
then defining translation as the transfer of meaning from one text to another is a dead end. Meaning is 
something we create when we interpret or create a text, but it never leaves our mind and thus never gets 


28 ALAN K. MELBY 


embedded in a piece of paper or other medium for representing a text. Therefore, meaning cannot be 
transferred from a source text to a target text. The translator assigns a meaning to the source text, hopefully 
similar to the intended meaning of the author, and creates a target text to which a reader will hopefully 
assign a meaning that corresponds to the author’s intent. 

Roman Jakobson (2000[1959]) suggests a definition that avoids claiming that meaning resides in a text. 
He defines translation as the act of interpreting a verbal sign, and he distinguishes among three types of 
translation: intra/ingua/ translation (commonly called paraphrasing or rewording); inter/ingua/translation 
(between two languages); and intersemiotic translation (between a human language and some nonverbal 
sign system). Today, translation is assumed to be interlingual translation unless clearly indicated otherwise. 
For Jakobson, translation is an act of interpretation that assigns meaning to a text rather than extracting 
meaning from a text. The directionality of Jakobson’s definition is thus the opposite of the directionality 
in traditional definitions of translation. (However, this argument is not one of unfettered relativism: the 
text itself certainly does have a profound influence on meaning, for the reader is always faced with a 
particulartext rather than another. It would certainly stretch credibility, for example, to argue that a 
service manual for a 1964 Ford Mustang is really a treatise on Zen Buddhist meditation practices.) Neither 
Jakobson’s nor any other definition of translation can be fully theory-neutral. I have chosen a definition of 
translation that is compatible with Jakobson but goes beyond it by adding an element called specifications: 


Translation is the process of creating a target text that corresponds to a source text according to 
specifications. 


1.3.STRUCTURED TRANSLATION SPECIFICATIONS. Rather than depending on a definition of 
meaning, the above definition of translation depends on an elaboration of translation specifications to avoid 
becoming impossibly vague. Such an elaboration exists and is found in a document published by ISO, the 
International Organization for Standarization (www.iso.org), that provides guidance for translation 
projects: ISO/TS- 11669 (2012). As editor of this document, I worked with delegations from many countries 
over a period of five years, finally arriving at a substantial degree of consensus on defining 21 translation 
parameters whose values for a particular translation become a customized set of structured translation 
specifications. These 21 parameters are divided into three main categories (product, process, and project) 
and are available to the public with descriptions of each (Parameters 2013). 


Product parameters include those that are used to describe the source text, such as its language and 
region, and those that are used to specify requirements for the translation, such as content 
correspondence, which may ask for either an overt or a covert translation. 

Process parameters are about the tasks to be performed, such as initial translation, revision (a 
bilingual task involving a comparison of the source and target texts to determine whether they 
correspond according to the product specifications), and review (involving a subject matter 
expert to determine whether the target text makes sense in a particular subject field, such as 
medicine or physics). 

Project parameters include requirements for the project environment, such as whether the work 
must be done in a secure facility, and the two project- relationship parameters between buyer 
and seller that everyone instinctively uses: cost and delivery deadline. 

There is nothing mysterious about any of these parameters, but they make the definition of translation 
at once precise and flexible. Not all quality translations are the same, but they do not vary chaotically. Each 
translation is defined relative to a set of structured specifications from the universal framework of 
parameters. 


HUMAN AND MACHINE TRANSLATION QUALITY 29 


Translations can be assessed according to just one category of parameters, such as product, in which 
case it doesn’t matter how long it takes to produce the target text or what steps were followed to arrive at it, 
or translations can be assessed according two or all three categories, namely product, process, and project. 

The mention of assessment indicates that a definition of translation quality is nearby, but in order to 
properly lay the foundation for that definition, I must introduce three types of translation stakeholders and 
three perspectives on quality in general. 


1.4. TRANSLATION STAKEHOLDERS. Someone asks for a translation. Someone gets the translation 
done. And someone uses the translation. In ISO/TS-11669, these stakeholders are called the requester, the 
provider, and the end-user. Sometimes the requester and the end- user are the same person, such as when 
someone asks a colleague to translate a short passage while he or she waits, but typically the requester and 
the end users are different. 
The term c/ient can be understood to be either the requester or the end user and therefore will be 
avoided for clarity and because it implies a commercial arrangement, yet translations can be 
requested and provided within an organization without any money changing hands. 


1.5. PERSPECTIVES ON QUALITY. There is a literature on quality in general that has been largely ignored 
in the world of translation. Garvin (1984) distinguishes three perspectives on quality that apply across many 
industries: transcendent, manufacturing, and user. 


The transcendent perspective assumes that there is an absolute ideal and that quality is measured 
by how close one comes to that ideal. In cooking, one might seek to bake the perfect apple 
pie. In music, an orchestra might strive toward the perfect performance of Beethoven’s Ninth 
Symphony. 

The manufacturing perspective is focused on compliance with specifications. Quality consists in 
meeting specifications, whatever they are. An electric motor can be assembled using quality 
components from any of several suppliers, and it will function equally well, since those 
components meet the specifications within extremely close tolerances. 

The user perspective is focused on the end-user experience. Quality is measured by how well a 
product or service meets the needs of the end user. In a sense the user perspective is about the 
adequacy of specifications. For example, suppose that a walking shoe has been manufactured 
according to the given specifications, but those specifications included a requirement that the 
front of the shoe extend an extra centimeter and point down 23 degrees. The shoe may look 
stylish (or strange) but the problem is that it is not good for walking. The extended toe will 
cause most people to trip unless they curl up their toes uncomfortably. The specifications were 
followed, but they were inappropriate. They were not “fit for purpose”, since the purpose of 
the shoes was to support comfortable walking. 


In translation, both the manufacturing and user perspectives apply. A quality translation must comply 
with the specifications, such as the structured translation specifications introduced above, and those 
specifications must be appropriate to end-user needs. However, there is a tendency to emphasize 
transcendent quality, which has two main components, which are often called accuracy and fluency. 
Accuracy refers to how well the source and target texts correspond. Fluency refers to how well the target 
text reads on its own as a document in the target language. A quality translation would be perfectly accurate 
and fully fluent. However, there are two problems with the transcendent perspective in translation. First, 


30 ALAN K. MELBY 


there is often a tension between accuracy and fluency. For example, an overt translation may be quite 
accurate but lack fluency because it has a foreign feel to it, while a covert translation may be quite fluent yet 
not be accurate in an absolute sense because elements of the source culture have been converted to 
approximate equivalents in the target culture. Secondly, the transcendent perspective ignores specifications, 
other than the goal of perfect accuracy and fluency. In particular, a strictly transcendent perspective 
pretends that there is a perfect translation that does not depend on audience or purpose. 


1.6. DEFINING TRANSLATION QUALITY. In June 2012, I gave an invited workshop on translation quality 
at the NATO translation office in Brussels. The morning was spent discussing structured translation 
specifications and perspectives on quality. In the afternoon, after a lunch break, one highly experienced 
translator expressed her frustration by declaring that the transcendent perspective is the only possible 
perspective. She is not alone. There is a deep-seated and praiseworthy drive toward transcendent quality in 
all things. Perhaps your grandmother told you that if something is worth doing, it is worth doing well. 
Nevertheless, I suggest that the manufacturing and user perspectives should not be ignored. How can this 
dilemma be resolved? At the NATO workshop, the dilemma was resolved by distinguishing between 
transcendent quality and functional quality. 

An element of resolving the tension between transcendent quality on the one hand and manufacturing 
and user quality on the other hand is the recognition that just as no definition of translation is theory 
neutral, no definition of trans/ation quality is theory neutral. There are many competing theories of 
translation. See Biguenet & Shulte (1992) for a compilation of primary sources illustrating translation 
theories from the 17" to the 20" centuries. See Gentzler (2001) for a presentation of more recent translation 
theories from the 1970s, 80s, and 90s. And see Pym (2010) for an analysis of both traditional and modern 
theories. As the primary basis for my definition of translation, I have chosen Functionalism, as promoted by 
Christiane Nord and extended by adding structured translation specifications as found in ISO/TS-11669. 
Nord’s Functionalism is not to be confused with Functionalism in philosophy (see Functionalism 2013), 
which is areductionist approach. Functionalism in translation is not reductionist. It acknowledges the 
variety of acceptable translations and types of translation along with the vast array of unacceptable 
translations. Functionalism in translation builds on the Skopos theory that developed in Germany during 
the 1980s. Also, during the 1980s, unaware of Skopos theory, I developed my own theory of translation, 
based on a decade of work as a member of an interactive machine translation project team, followed by a 
decade of development of tools for human translators. I published an initial exposition on translation 
theory at the end of the 1980s (Melby 1990) but then discovered Functionalism and decided to work within 
it rather than compete with it. 

Working within Functionalism means that we will assume that every translation has a purpose (that is, 
an intended function) and an intended audience, even if, in the extreme case, the audience is only the 
translator playing around with language and the purpose is only to have fun. Translation in a commercial 
environment is done for a requester with a purpose and an end-user profile in mind; otherwise, there would 
be no reason to pay for a translation. 

Neither Functionalism nor any other translation theory is accepted by everyone. Although it seems 
obvious to talk about the purpose of a translation, relative to an intended audience, it is a sufficiently 
significant shift from the past for it to be noted in the title of Nord’s 1997 book, Translation as a Purposeful 
Activity. 

As Ed Genztler (1998) points out in his review of Nord (1997), this is a shift from source-text oriented 
theories, where the only choice is between a “faithful” translation and a “free” translation of the source text, 
Nord “breaks the chain of 2,000 years of theory revolving around the ‘faithful vs. free’ axis” and brings in 
target-culture issues. For one audience and purpose a fairly literal translation might be more effective. For 


HUMAN AND MACHINE TRANSLATION QUALITY 31 


another audience and purpose, a free translation that replaces some foreign items of source culture with 
reasonably equivalent target-culture items might be more effective. The question is no longer whether all 
translations should be faithful to the original or whether all translations should be free translations. What 
used to be the fundamental question of translation theory becomes a strange question to even ask. The new 
question is what are the purpose and audience of the translation and how do they affect the translation 
“brief”. I have replaced the translation brief with structured translationspecifications. 


1.7. A FUNCTIONALIST DEFINITION OF TRANSLATION QUALITY. At this point, Iam prepared to 
present a definition of functional translation quality: 


A quality translation demonstrates required accuracy and fluency for the audience and purpose 
and complies with all other negotiated specifications, taking into account end-user needs. 


So far as I am aware, as simple as it looks, this is a new definition that has not been presented by anyone 
else. Its power may be hard to recognize because of its flexibility and simplicity. 

This definition includes all three perspectives from the literature on quality. 

Transcendent quality, while not accepted as an absolute, is acknowledged by explicitly mentioning 
accuracy and fluency. Manufacturing quality is recognized as compliance with specifications, and user 
quality is brought in to remind us that specifications can be well- defined yet wrong for a particular 
audience. All three stakeholders are found implicitly in the definition. Requesters and providers negotiate 
the specifications, but those specifications must meet the needs of end-users. Structured specifications 
rescue the definition from undue vagueness. 

Various types of assessment are supported by the definition, including: 


product assessment, in which case only the product-oriented translation parameters are given 
values in the specifications, 

process assessment, which focuses on the process-oriented translation parameters, and 

project assessment, which focuses on project-oriented translation parameters. 


Suppose a highly-confidential German source text about a new type of light bulb is to be translated into 
Bulgarian and delivered to the client within three weeks, without divulging anything about it to the 
competition. Further suppose that the target text is translated according to the agreed-upon linguistic 
specifications but delivered a month late and is useless to the client. According to a strictly product-focused 
assessment, the translation could be very good, but according to a project-focused assessment, it would be 
a bad translation no matter how good the target text is linguistically, because the project violated an 
important relationship specification, the delivery date. Or suppose the translator asks a colleague for help 
and sends the confidential source text along with a question, but the colleague turns out to work for the 
competition, and the colleague passes on crucial information about the light bulb, and furthermore the 
competition takes advantage of the confidential information. According to a project-focused assessment, it 
would be a bad translation because the confidentiality requirement of the environment specifications was 
violated. 

Or suppose that the translation about the new kind of light bulb that was delivered to the client is 
eventually found to contain a serious error and the project manager wants to determine where the error was 
introduced. In this case, a process-focused assessment would be relevant. If records were kept of the target 
text after initial translation, after revision, after review, after final layout, and after proofreading, the project 


32 ALAN K. MELBY 


manager will be able to determine exactly where in the document production process the error was 
introduced and who missed it further along in the process, and corrective action can be taken. 

The proposed definition supports any combination of product, process, and project assessment, 
depending on which categories of translation parameters are given values in the structured specifications. 

In answer to the question of this section, translation quality turns out to be definable. 

Of course, it remains to be seen whether this definition is useful. 


2. ACHIEVABLE? Although much human translation is faulty, it is assumed in this paper that professional 
human translators can produce quality target texts, especially working into their native language on a text 
in a subject field they are familiar with. After mentioning some use-cases for machine translation and listing 
some highlights in the history of machine translation, a debate about its future will be summarized. Then 
the definition of translation quality developed in the previous section will be presented as a means to avoid 
endless back and forth about obstacles to machine translation and how they might be overcome. 

It is difficult for many to accept that a translation does not always need to be completely fluent and 
accurate in order to be a quality translation. For example, a system that translates instant messages in real- 
time during a chat session between two people writing in different languages might include a machine 
translation system that exhibits functional quality even though it sometimes produces output that is 
unintelligible. Suppose that the specifications require that the system produce translations of up to 140 
characters of source text within one second (this would be part of the value of ISO/TS-11669 Parameter 20, 
Deadline). If human translators take an average of ten seconds to translate an instant message (assuming any 
human translator would want to do this work), then they are not producing functional quality, even though 
their translations are more accurate and fluent than raw machine translation. However, the specifications 
could also disqualify the speedy machine translation system if the requirement for accuracy and fluency 
(which falls under Parameter 9, Content Correspondence) is that fewer than one out of ten translations is 
rejected by the user as unintelligible, yet two out of ten chat translations are rejected by end users. An 
important empirical question is whether users are able to accomplish their objective of communicating via 
chat even though they don’t both speak the same language. One factor that will be crucial is how often instant 
chat translations are accepted by an end user, that is, are reasonably fluent, yet cause serious 
misunderstandings because they are not accurate. If it turns out that interlocutors are able to detect fluent yet 
inaccurate translations and rephrase a question in order to solicit a response whose translation is more 
useful, then a fast machine translation system with less accuracy and fluency than a slower human translator 
would be seen as a quality solution while the human would not be seen as providing a quality solution. 

Moving beyond this example of competition between humans and machines in a chat environment, 
consider environments where accuracy is paramount. There are a variety of translation specifications for 
projects involving health care materials (where inaccuracies can result in harm or death to a patient) and 
transcription and translation of court depositions (where inaccuracies can result in a mistrial). Here speed 
does not compensate for lack of accuracy, and thus these types of translation will have very different 
specifications than chat translation. 

Consider what types of specifications would be appropriate for fully-automatic machine translation of 
technical support database entries, human translation of advertising materials, summary translations of 
messages between suspected terrorists, published translations of annual corporate reports, and even literary 
translation. 

Some find it easier to accept a specification-based definition of quality, as opposed toa transcendent- 
only definition, by considering the fact that quality creates value. All three perspectives— transcendent, 
manufacturing,and user—work together to producevaluetoa requester or end-user. A product or service can 
have value without being perfect in all respects, but it must satisfy an acceptable set of specifications. 


HUMAN AND MACHINE TRANSLATION QUALITY 33 


People want things to become faster, cheaper, and better. Typically, vendors reply, “pick two of the three”. 
We are told we must compromise, and we usually accept this. However, this dictum, applied to translation, 
assumes a transcendent view of quality, which is problematical, as explained above. In the proposed 
definition of translation quality, speed (meeting the delivery deadline) and cost are part of quality. Realistic 
specifications require balancing speed, cost, and other aspects of quality. The question for translation then 
becomes whether a particular set of specifications is achievable and, if not, what compromises could be 
made to arrive at an achievable set of specifications. 

A yearning for perfection has been behind the dream of machine translation from the beginning. Of 
course, for some sets of specifications, translation quality has already been achieved. For example, the 
European Commission and the United Nations each produce millions of words of quality translation per 
year, in that they satisfy their own specifications. However, these translations are expensive and slow. One 
aspect of the question for this section, “Achievable?” is whether it will ever be feasible to build a machine 
translation system that produces the ultimate in FAHQT: full-automatic translation that meets 
requirements of accuracy and fluency tailored to the audience and purpose and that is as good as the work 
of the best professional human translators, only much faster and much less expensive? In other words, can 
we have it all: faster, cheaper, and better? 

In a sense, we have had it all with computers. For a number of years, computers have been getting faster, 
cheaper, and easier to work with. The question is whether we can have a similar advance in machine 
translation. Clearly, such an advance would be a dream- come-true for machine translation software 
developers and a nightmare-come-true for human translators. 

I will not be predicting an answer to this question; however, I will provide a little history of what others 
have said about it, and, in the next section I will comment on Kurzweil’s prediction about what he calls the 
Singularityand how it might impact translation and other aspects of society. 


2.1. HISTORY. The history of machine translation has been punctuated by periods of optimism and 
pessimism concerning this question. Peter Toma, one of the pioneers in commercial applications of 
machine translation, was involved in the early work in the 1950s and later founded Systran 
(http://www.systransoft.com/), a machine translation company that has been around for over forty years. 
Peter Toma was an optimist. He was not discouraged by the ALPAC report, and around 1975, in a face-to- 
face meeting, he told me that if someone would give him all the needed rules, he would program them and 
produce FAHQT. He was thus the epitome of the rule-based machine translation (RBMT) true believer. I 
was hopeful that I would be able to help find some of those elusive rules that Peter was looking for. However, 
as we will see, rule-based approaches to machine translation have been largely overtaken by statistical 
approaches. 

By 1980, I had gone through an intellectual crisis triggered by questioning my deepest beliefs about the 
nature of language. I switched from working on machine translation to working on productivity tools for 
human translators, with an emphasis on translator workstations, seeing the potential of microcomputers 
connected to remote computers as the home for translator tools, even before the introduction of the IBM 
PC in 1981. A detailed description of my intellectual crisis and revised views on the nature of language can be 
found in my book, The Possibility of Language (Melby 1995). 

In 1980, while I was working on tools for human translators, Martin Kay wrote an internal report for 
Xerox PARC (Palo Alto Research Center) that was not published until 1997 but had a wide influence 
through photocopies that were passed around among those involved in translation technology. See Kay 
(1997[1980]), where he proposed an MAHT approach and argued against the use of raw machine 
translation. 


34 ALAN K. MELBY 


Serge Perske led the machine translation effort in the European Commission. He held views 
diametrically opposed to those of Kay and me. In 1984, at Stanford University, during the Coling 
(computational linguistics) conference being held there, he approached me on a grassy campus quad and 
told me not to waste my time designing translator workstations, since within five years there would be no 
more human translators. A decade later, there were more human translators than ever, and Serge had lost 
his job. The massive rule-based machine translation project, Eurotra, was funded by the European 
Commission from 1978 to 1982. It never resulted in a commercially viable system that produced better 
results than Peter Toma’s linguistically simpler approach. 

In the early 1990s, a new approach to machine translation was initiated by a research project at IBM 
called Candide. Based on the assumption that hand crafting of thousands of rules to perform syntactic and 
semantic analysis of the source language, adjustment of the resulting representation during the transfer 
phase, and then generation of a target-language text was a dead end, they tried a statistical approach. In the 
words of Adam Berger, amember of the CANDIDE team: 


In speaking a French sentence F, a French speaker originally thought up a sentence E in English, but 
somewhere in the noisy channel between his brain and mouth, the sentence E got “corrupted” to its 
French translation F. The task of an MT system is to discover [...] the optimal English sentence, 
given the observed French sentence. This approach involves constructing a model of likely English 
sentences, and a model of how English sentences translate to French sentences. Both these tasks are 
accomplished automatically with the help of a large amount of bilingual text. As wacky as this 
perspective might sound, it’s no stranger than the view that an English sentence gets corrupted into 
an acoustic signal in passing from the person’s brain to his mouth, and this perspective is now 
essentially universal in automatic speech recognition. (Berger 2013) 


Although initially derided by the machine-translation community in the 1990s, within a decade, 
this approach, called Statistical Machine Translation (SMT), began to show some impressive 
results. Google Translate started out as a rule-based system, but in 2006 it was switched over to a 
statistical system (Gorman 2012). 


Strange as it sounds, most of those involved in SMT development are not theoretical linguistics or 
translators. In fact, in most SMT systems, there is no syntactic analysis at all. Everything is based on simple 
sequences of words in source and target language texts. According to noted MT researcher Hans Uszkoreit 
(pers. comm.), it is not an uncommon phenomenon for SMT researchers to lose track of what languages 
they are working on while adjusting their systems. 

Statistical machine translation systems are trained by feeding in a huge corpus of texts and their 
translations that have been pre-processed so that each segment of each source text is aligned with a segment 
of its target text that best corresponds to the translation of that source segment. From this corpus the system 
builds huge tables of correspondence showing how words will likely be translated when surrounded by 
particular words. I think of SMT as massive parallel plagiarism of bits and pieces from human translations. 

Also, in 2006, the keynote presentation of a major machine translation conference was a public debate 
between a proponent of SMT, Daniel Marcu, and myself (a token linguist), about obstacles facing SMT, 
labelled as “data-driven machine translation” for the debate, and how to overcome them.’ One of the slides 
features the following quote from a major figure in SMT: 


' An annotated set of slides from that debate can be found at: http://www.ttt.org/amta. 


HUMAN AND MACHINE TRANSLATION QUALITY 35 


Within the next few years there will be an explosion in translation technologies, says Alex Waibel, 
director of the International Centre for Advanced Communication Technology, in response to the 
question of how far machine translation systems can be taken, “There is no reason why they should 
notbecome as good, if not better, than humans,” Dr Waibel says. 


Daniel Marcu, my debate partner and a true believer in statistical machine translation, did not see the quote 
as farfetched. 

I suggested that several obstacles needed to be overcome by SMT in order for it to produce raw output 
equivalent to professional human translation. These obstacles included the need for machines to 
demonstrate the same competencies expected of a professional human translator, such as: 


Ability to understand source text, 

Ability to write in target language, and 

Ability to adjust to audience and purpose when translating and evaluating whether source and 
target texts correspond 


Daniel responded by noting that “airplanes don’t bat their wings, but they still fly.” In other words, there 
is no reason to believe that machines need to solve the problem of translation the same way humans do. He 
brought up the Chinese Room thought experiment’ devised by John Searle, and suggested that for all we 
know, statistical machine translation systems may already understand what they are translating. As I have 
debated the question of machine understanding with other proponents of SMT, I have realized that this line 
of argument about whether machines understand language is fruitless. Until there is general agreement on 
what constitutes understanding, there will be little progress in the debate as to whether machine translation 
systems understand language. 

Another obstacle I presented was the need to take into account all types of context. In an article about 
context in translation (Melby &Foster 2010), Iidentify five types of context, and in the debate, I pointed out 
that SMT then only dealt with a co-text (the words immediately surrounding a word) and bi-text (links 
between segments of source text and segments of target text). I suggested that machine translation systems 
will eventually have to deal with long-distance context where keyword lookup is insufficient and context 
that goes beyond what is described in text. Daniel responded that given time and resources, machine 
translation will be able to reach beyond co-text and that there is no information needed by machine 
translation that cannot be found in texts. Thus, the debate about context led nowhere in 2006, except for an 
esoteric yet potentially significant question about “non- text” that can be examined by consulting the 
appendices to the Melby and Foster (2010) paper. 

Yet another obstacle I presented is that in order to produce human-like translations, machines will need 
to demonstrate second-order creativity. Much has been written about orders of creativity. See, for example, 
Ekvall (1997). For the purposes of the debate, I defined first and second order creativity as follows: 


First-order creativity involves algorithmically generating an infinite number of items from a finite 
system; second-order creativity involves creating elements outside that infinite result. 


Anyone familiar with Generative Grammar, as defined by Noam Chomsky and his many colleagues, will 
recognize that I am tying first-order creativity to what Chomsky has referred to as the creative aspect of 


* See http://plato.stanford.edu/entries/chinese-room/ 


36 ALAN K. MELBY 


language, namely, that a simple phrase-structure grammar that can be written down in a few lines can 
generate an infinite number of sentences using recursive relative clauses. As applied to translation, an 
instance of second-order creativity would be either (a) finding a solution to a translation problem when 
that solution is not found in the corpus of bilingual texts that was used to train the system or (b) recognizing 
that none of the solutions in the corpus are appropriate. In other words, detecting that something is amiss 
in all the suggested translations and finding a solution not found in the material used for training the 
machine translation system would constitute second-order creativity. Daniel’s response was that human 
translators make mistakes, so why should machines not be allowed to make mistakes? 

The debate was inconclusive (an exit poll revealed that those in the audience who went into the debate 
believing in SMT as the ultimate solution still believed in it at the end of the debate and the skeptics 
remained skeptical). However, the rate of improvement in SMT output has slowed in recent years, and 
although I recently saw Daniel Marcu at a translation- related conference and suggested another debate, no 
date has yet been set forone. 

As of 2014, the verdict is still out on machine translation. There seems to be a growing consensus that 
statistical methods alone are not sufficient to achieve human-like translation, and a number of researchers 
are trying various hybrid techniques, combining statistical and rule-based approaches. Some high-profile 
commentators, such as Nicolas Ostler (2010) argue that it is inevitable that soon “everyone on the planet 
will be able to communicate using machine translation”. The problem with statements like this is that they 
neglect to iscuss the quality of machine translation. 

After careful consideration, I have concluded that it is not very useful to pile up obstacles to progress in 
machine translation. A more useful debate would be how to determine when a translation is a quality 
translation. The definitions of translation and translation quality given in the previous section could be 
usefully debated. The definitions must be shown to be both reasonable and consistent with the real-world 
experience of experts. 

When a group of people within the translation community (including machine translation developers, 
translators, translation project managers, and translation requesters and users, and academics) have 
reached consensus on definitions of translation and translation quality (hopefully similar to those presented 
in the previous section), it should become apparent that the question of whether quality machine translation 
is feasible is not a good question. It implies a yes or no answer. A much better question would be: how does 
one conduct a translation quality assessment? 

The translation being assessed may be a raw machine translation, a human translation produced by one 
professional translator, or the result of a process that includes both machine translation and human 
translation, with several people involved and several types of technology in addition to machine translation. 

The first question asks where the specifications are. Assessment is a non-starter unless you have the 
specifications for the translation. If the specifications were not made explicit before the production phase 
began, then they must be inferred after the fact. 

The second question to ask is what is being assessed? Sometimes the product is assessed in isolation. 
Then, all the assessor has is the source text, the specifications, and the target text. Sometimes, aspects of the 
project will be assessed, such as whether specified resources were consulted, whether confidentiality was 
kept as required, and whether the product was delivered on time. And sometimes the process followed will 
be part of the assessment, especially when a product-oriented assessment reveals errors and the source of 
those errors is being tracked down. Project and process assessment are fairly straightforward, relative to 
product assessment. 

When product assessment is involved, there are two well-known approaches: holistic and analytic. A 
holistic assessment, in the extreme, starts with the three essentials, that is, the source text, the target text, 
and the specifications, and the assessor grades the translation as a whole, relative to the specifications, along 


HUMAN AND MACHINE TRANSLATION QUALITY 37 


a scale such as good, bad, or mediocre. The problem with holistic assessment is that it is hard to know what 
to fix in order to improve. Actually, there is something even less useful than a holistic assessment as just 
described. That would be a holistic assessment without any explicit specifications. In that case, the grader 
uses whatever subjective criteria come to mind, without even knowing for sure why the translation was 
done and for what audience. 

An analytic assessment, in the extreme, starts with the same three essentials but uses a much more 
detailed metric, which typically includes error categories that are selected and weighted. Errors can be 
marked right down to the word level. The combination of errors and their severity results in a total number 
that represents how far the translation is from a quality translation, that is, one that meets the specifications, 
assuming the specifications are end-user appropriate. (Note that there are many possible metrics, each 
associated with a set of specifications. An approach to analytical assessment that is totally incompatible with 
the definition of translation quality in this paper is to attempt to define one metric that applies to all 
translations.) 

Other approaches to assessment are being explored, such as evaluation of pre-selected items (Kockaert 
& Segers, forthcoming), while various assessment methods used only for machine translation involve 
automatic comparison between raw machine translation and one or more “gold standard” translations of 
the source text, typically prepared by a panel of human translators, to see how similar the machine output 
is to the human translation. The machine translation system is tweaked and the output is compared with 
the gold standard. This is done over and over to see if modifications to the machine translation system 
improve or degrade the output. While such measures are useful, they are problematic, because they depend 
on the reference translations. Preliminary research done at the German Research Center for Artificial 
Intelligence (DFKI), has shown that when a translation’s score depends on the set of references, changing 
the references used can change the score significantly 

without the translation under evaluation itself changing (Arle Lommel, pers. comm.). Nevertheless, 
these methods show that MT quality, if defined on the basis of similarity to human reference translations, 
still falls short, even in the best of circumstances (see Figure 1). 

One project, called QT Launchpad (http://www.qt21.eu/launchpad/) has taken the definition of 
translation quality in this paper as a starting point and has developed a system for building customized 
metrics, called Multidimensional Quality Metrics (MQM) that are all tied to a product-oriented subset of 
the translation parameters in ISO/TS-11669 (see Lommel et al. 2014). It will be informative to see whether 
the metrics resulting from the QT Launchpad project can be shown to be valid and reliable in the sense 
these terms are used in assessment theory and practice. 

A reliable metric is one that gives the same or very nearly the same result when used independently by 
two or more graders. In other words, it does not matter who grades a translation. The score will be about 
the same. 

A valid metric is one that gives results that match the intuition of experts. One way to determine 
whether a translation metric is valid would be to give a team of translators who are recognized experts both 
the three essentials (source text, target text, and specifications) and the score assigned to the translation bya 
grader. Validation could begin with some clear- cut cases of good translations and bad translations. If the 
metric assigns the good translations a good score and the bad translations a bad score, then the metric is at 
least somewhat valid. Less dramatically different translations could then be assessed to see if changes that 
the experts agree are improvements, relative to the specifications, result in a better score, using the metric. 

Clearly, a metric must be both valid and reliable in order to be useful. However, surprisingly, little has 
been done to study metrics in this way. 

Now I can answer the question asked at the beginning of this section. Is quality machine translation 
achievable? Yes, sometimes. It depends on structured translation specifications. For some specifications, 


38 ALAN K. MELBY 


quality machine translation was achieved long ago. One well- known example is the Météo system 
(Hutchins &Somers 1992:207—20) that was developed at the University of Montreal in the 1970s. 

The specifications for the Météo system were to translate a particular type of Canadian weather bulletins 
from English to French well enough that only a word here or there needed to be corrected by a post-editor. 
All the bulletins were written by meteorologists who went through the same training, and they were all in 
the same style. Météo was a spectacular success, in that most of the sentences of raw machine output were 
indistinguishable from what would have been produced by a professional translator, but it is hard to find 
additional sublanguages that are naturally occurring and have a continuous high volume of source texts that 
need to be translated. Nevertheless, Météo is an example of an environment in which quality machine 
translation has been achieved. 

There are, of course, many examples where, for a given machine translation system and a given set of 
specifications, quality raw machine translation is not currently feasible. Combining humans and machines 
in HAMT or MAHT configurations brings many more possibilities. 

Thus, the answer to the question of this section is not whether quality translation involving machines 
is achievable but when it is achievable, always relative to explicit specifications. 


3. DESIRABLE? The previous section attempted to demonstrate that quality translation involving machines 
is definitely achievable for some specifications. This is becoming less and less of an issue. A question that is 
still an issue is whether it is desirable to attempt to build a machine translation system that achieves FAHQT, 
that is produces quality raw output no matter what the specifications. Although there is current government 
funding for machine translation development, it is for particular specifications, whether they are explicit or 
not. However, some academic researchers in machine translation are still aiming for FAHQT. Is this a waste 
of time and effort? For a number of years after my departure from machine translation development and 
my entry into translator tool development, I was uncomfortable with attempts to achieve FAHQT and 
actively discussed obstacles to it, implying that if researchers didn’t know how to overcome those obstacles 
they should give up and pursue more promising research. 

Recently, I have radically changed my position on the desirability of trying to achieve FAQHT. I now 
not only encourage work toward it but have devised a variation on the classic Turing Test that will help us 
objectively determine whether it has been achieved. 

In the classic Turing Test (Turing 1950) a computer attempts to imitate a human ina conversation. The 
human judge is randomly connected to either a human or a computer for each test run. The two parties chat 
using instant messaging of some kind. In the 1950s this was accomplished using Teletype machines. At the 
end of each conversation, the human indicates whether the interlocutor was a human or a computer. If 
humans guess wrong more than 30 percent of the time, then the computer is deemed to possess human 
intelligence. 

In my proposed variation, called the Translation Turing Test (TTT), a computer tries to imitate a 
professional translation project manager in a commercial environment. The human judge is a real 
customer, a requester of translation services. The requester develops an initial request for proposals (RFP), 
using structured translation specifications as in this paper. Then the test proceeds like the classic Turing 
Test in that the customer is randomly connected to either a human or a computer. However, in the 
Translation Turing Test, the conversation is about the project specifications, rather than being a general 
discussion between strangers who may have little in common. The project manager and the requester 
interact about the specifications in a back-and-forth conversation initiated by the requester, clarifying and 
completing the specifications as needed, until the requester and the project manager have reached 
consensus concerning the specifications. Then, the conversation is put on hold until the agreed upon due 
date, during which time the translation takes place. If the project manager is a human, s/he finds a human 


HUMAN AND MACHINE TRANSLATION QUALITY 39 


qualified translator who gets the job done using any desired technology. If the project manager is a computer, 
then absolutely no human involvement is allowed. The translation is then delivered to the customer, along 
with anything else listed in the specifications. Once again, the conversation is put on hold to give the 
customer a chance to contact a qualified translation assessment expert, who evaluates the translation 
according to the specifications. The customer then guesses whether the project was completed using 
humans or FAHQT. If the customer guesses wrong more than 30 percent of the time, the machine 
translation system is judged to have passed the Translation Turing Test. 

Obviously, this is a very hard test to pass. Not only must the computer be capable of engaging in an 
intelligent conversation about translation specifications, which probably means it could pass the classic 
Turing Test (this is an interesting empirical research question), but the machine translation system must 
be capable of translating between any pair of languages in any subject field for any audience and purpose 
and satisfy all other elements of the specifications. Probably only academic researchers would attempt to 
pass the Translation Turing Test, yet it is clearly an interesting philosophical question whether a machine 
could be built that would do so. 

What about those not interested in working on the Translation Turing Test? Are there less challenging 
but still useful goals to be pursued while the ultimate machine translation system is under development? 
Yes, definitely. Again, the definition of translation quality in this paper helps define those goals. 

I have devised another alternative to the Turing Test that brings together these goals. At alunch meeting 
in Berlin with some colleagues, I became convinced that the second variation of the Turing Test strays too 
far from the classic test to be called a Turing Test, so I have named it the Translation Technology Test, to 
avoid any danger of posthumous discomfort for Alan Turing. 

There are a multitude of Translation Turing Tests. Thus, we can distinguish from the Translation 
Turing Test (the TTT) and a Translation Technology Test (a TTT) by the use of the choice of English article. 
Another solution would have to be found in a language that does not use definite and indefinite articles 
similar to those in English. 

Each Translation Technology Test is specific to one set of specifications that allow for a range of similar 
source texts and one translation environment, which may be anything from raw machine translation with 
no post-editing to human translation done by a human translator who uses a translation tool and whose 
competencies match the specifications. A translation quality metric is chosen that is appropriate for the 
specifications and was derived from a framework of metrics that has been shown to be valid. There is no 
project manager and thus no conversation between the customer and the project manager. A variety of source 
texts are submitted, and the translations are graded by translation quality assessment professionals. The test 
is not allowed to proceed unless there is a reasonable assurance that the grading is reliable. A threshold is 
established for grades according to how close professional human translators come to satisfying the 
specifications. If at least 70 percent of the translations meet the threshold (that is, if they fail less than 30 
percent of time), then that translation environment is judged as having passed one Translation Technology 
Test, the one associated with those specifications. 

It must be kept in mind that the threshold of meeting the specifications is not set according to whether 
the [word missing] meets transcendent requirements of full accuracy and fluency. The threshold is set 
according to whether the translation meets the specifications, which may, for example, de-emphasize 
fluency. This implies that the metric must be built according to the specifications, which is consistent with 
the principle that there can be no universal translation metric. 

TTTs apply to the entire spectrum of translation types described at the beginning of this paper (fully 
automatic, to HAMT, to MAHT). It probably also applies to human translation where no technology is used 
other than a pencil and paper, although this type of translation is no longer competitive with MAHT in the 
commercial market. Its only place is in a translator certification examination, and even there, some 


40 ALAN K. MELBY 


translator certification systems are moving toward keyboarded examinations on a computer. 

In some ways, Translation Technology Tests have been around for a long time. As soon as people started 
backing off from seeking one metric to assess all kinds translation environments and started believing that 
machine translation can be useful even if it is not fully accurate and fluent, they essentially started moving 
toward Translation Technology Tests (TTTs). What I am suggesting is that a formal and repeatable 
methodology for TTTs should be developed and used extensively throughout the world of industry, 
including government translation offices, and even in academia for translator education. 


CONCLUSION. As described in section 2 (Achievable?), predictions of the imminent replacement of all 
human translators by machines have come and gone, often with a promise of significant progress within five 
years. Ray Kurzweil, a futurist, has been writing about the upcoming Singu/arityfor years. The Singularity is 
the point in time when human intelligence is given to a computer, probably an embodied computer. 
Kurzweil’s 2005 book The Singularity is Near, predicts that the Singularity will take place by the year 2029. 
In a June 2011, interview with Ray Kurzweil, Nataly Kelly (2011) confirmed with Ray Kurzweil that the 
2029 Singularity includes the capability to translate as well as or better than the best human translators. In 
other words, we will have FAHQT and a machine will pass the Translation Turing Test by 2029, according 
to Kurzweil. 

[have read The Singularity is Near, and I have come away from it convinced that at least Kurzweil has a 
feeling for how difficult the task for replicating human intelligence will be. I do find what I think is a major 
flaw in Kurzweil’s logic. He predicts that once we have built true artificially intelligent entities, they will be 
able to share knowledge much more quickly than humans can, essentially instantly. I strongly disagree. 
Kurzweil acknowledges that Singularity intelligences would be able to reconfigure themselves and develop 
new conceptual categories at the most basic level of fundamental world view, just like humans, as least open- 
minded humans. Instant information sharing within a predetermined, shared domain of knowledge may 
be possible, but each Singularity intelligence will quickly evolve as it learns and integrates perceptions into 
unique mental categories, just as each human intelligence is unique. Singularity intelligences, just like 
languages and cultures, would evolve along their own path and would probably have to use language, with all 
its inefficiency and fundamental ambiguity, to communicate among themselves. 

As pointed out by a colleague, Robert Orr, after the 2012 oral version of this paper, what sets humans 
(and, I would add, Singularity intelligences, apart from current statistical machine translation systems is 
their ability to organize a collection of instances of language, such as idioms, into a coherent semantic 
system without a pre-existing framework). Orr cites Makkai (2009),which proposes a two-stage 
development of idioms in child language acquisition, as an example of the distinction between what 
statistical machine translation can currently do and what humans can do. 

Ultimately, machine translation systems will need to organize language information into the same strata 
used by the human mind. See Lamb (1999) for a detailed exposition of these strata from multiple 
perspectives (linguistic, cognitive, and neurological). 

In addition to the ability to organize information the way humans do, Singularity intelligences would 
possess free will, as do humans, and may choose other activities than translation. Thus, it isan open question 
whether Singularity intelligences would replace human translators. Kurzweil himself suggests that the 
human translation profession would evolve but not disappear. 

Since Kurzweil has been bold enough to make a prediction about the future of machine translation, I 
will also make some predictions: 


The acceptance of my definition of translation quality and the use of structured specifications in 
particular will gradually increase and will help all stake holders communicate better and both 


HUMAN AND MACHINE TRANSLATION QUALITY A] 


assess and improve human and machine translation quality through human-machine 
collaboration, notcompetition. 

During the rest of the decade, that is between now and 2020, the only humans who will be replaced 
by MT will be those fail to use technologyeffectively. 


If Kurzweil is right and the Singularity arrives by 2029, the news that machines can translate as well as 
professional human translators will be lost in the world- wide commotion brought about the realization 
that computers really are smarter than humans. This will impact every aspect of human life, since every 
interaction by text or voice could be with a human or a machine. Almost everyone on the planet will be in 
danger of losing their job to a robot, assuming the raw materials and energy needed for them to build more 
of themselves will be available. I personally look forward to this adventure and hope to be around in 2029 to 
see what happens. 
The bottom line is that human translators should stop worrying about being replaced by a computer 
and instead make friends with translation technology. 
Now, in practical conclusion, I present five serious action items for five types of players in the translation 
industry: 
for requesters and providers: use structured translation specifications from provider selection to 
production to assessment of the results; 
for translation tool developers: continue to focus on human-machine collaboration and incorporate 
translation metrics into tools; 
for machine translation developers. embrace the idea of Translation Technology Tests; 
for Al researchers. aim at passing the general TTT test (but beware of autistic MT!— some 
explanation of this comment is required for further discussion, but that is for another paper); 
and 
for end users: ask for the specifications that were used during translation and make use of them in 
assessment and in providing feedback. 


REFERENCES 


ALPAC. 1966. Languages and machines: computers in translation and linguistics. Areport by the 
Automatic Language Processing Advisory Committee, Division of Behavioral Sciences, National 
Academy of Sciences, 

National Research Council. Washington DC: National Academy. 

BELLOS, DAVID. 2011. Is that a fish in your ear?: Translation and the meaning of everything. London: 
Faber & Faber 

BERGER, ADAM. 2013. Statistical machine translation. http://www.cs.cmu.edu/~aberger/mt.html 
(accessed August 2013). 

BIGUENET, JOHN & RAINER SCHULTE (eds.) 1992. Theories of translation: An anthology of essays from 
Dryden to Derrida. Chicago: University of Chicago Press. 

EUROPEAN COMMISSION. 2012. Quantifying quality costs and the cost of poor quality in translation. EC 
document ID: HC-31-12-463-EN-C. http://bookshop.europa.eu/en/quantifying-quality-costs-and- 
the-cost-of-poor- quality-in-translation-pbHC3112463/ (accessed August 2013). 


42 ALAN K. MELBY 


EKVALL, GORAN. 1997. Organizational conditions and levels of creativity. Creativity and innovation 
management 6(4):193-258. (DOI: 10.1111/1467-8691.00070). 

GENTZLER, EDWIN. 2001. Contemporary translation theories. Bristol: Multilingual Matters. 

—. 1998. Review of Translation as a purposeful activity: Functionalist approaches explained, by Christiane 
Nord. Translation and literature 7(2):266-76. 

GLEICK, PETER H. 2000. Harry Potter, minus a certain flavour. Op-ed, New York Times, July 23. 

GOLDSTEIN, STEVEN. 2004. Translating Harry, part I: The language of magic. 
http://bytelevel.com/global/translating_harry_potter.html (accessed June 2014). 

GORMAN, MICHAEL. 2012. Google gives us some insight on the inner workings of Google Translate. 
http://www.engadget.com/2012/03/19/google-translate-how-it- works/ (accessed July 2013). 

HAGUE, DARYL, ALAN MELBY, & WANG ZHENG (2011). Surveying translation quality assessment: A 
specification approach. The Interpreter and Translator Trainer 5(2):243-67. 

HOUSE, JULIANE. 2001. Translation quality assessment: Linguistic description versus social evaluation. 
Meta: Translators’ journal 46(2):243-57. http://id.erudit.org/iderudit/003141ar (accessed August 
2013). 

—. 2010. Overt and covert translation. In Handbook of translation studies, vol 1, ed. by Yves Gambier & 
Luc van Doorslaer, 245-46. Amsterdam: Benjamins. 

HUTCHINS, JOHN & HAROLD L. SOMERS. 1992. An introduction to machine translation.London: 
Academic Press. (Relevant chapter is available online at http://www.hutchinsweb.me.uk/IntroMT- 
12.pdf) 

Iso. 2012. ISO/TS-11669:2012, Translation projects -- General guidance. Geneva, Switzerland: 
International Organization for Standardization 

JAKOBSON, ROMAN. 2000[1959]. On linguistic aspects of translation. In The translation studies reader, 
ed. by Lawrence Venuti, 113-18. London: Routledge. 

KAY, MARTIN. 1997[1980]. The proper place of men and machines in translation. Machine translation 
1215223: 

KELLY, NATALY. 2011. Interview with Ray Kurzweil. http://www. huffingtonpost.com/nataly-kelly/ray- 
kurzweil- on- translati_b_875745.html (accessed August 2013). 

KOEHN, PHILIPP. 2010. Statistical machine translation. Cambridge: Cambridge University Press. 

KOCKAERT, HENDRIK J. & WINIBERT SEGERS. (forthcoming). Evaluation de la traduction : la méthode 
PIE (Preselected Items Evaluation). To appear in Turjuman: revue de traduction et d’interprétation / 
journal of translation studies. 

KURZWEIL, RAYMOND. 2006. The Singularity is near: When humans transcend biology. 

New York: Viking. 

LAMB, SYDNEY. (1999). Pathways of the brain: The neurocognitive basis of language. 
Amsterdam: John Benjamins. 

LOMMEL, ARLE, ALJOSCHA BURCHARDT, MAJA POPOVIC, KIM HARRIS, ELEFTHERIOS AVRAMIDIS 
& HANS USZKOREIT. 2014. Using a new analytic measure for the annotation and analysis of mt 
errors on real data. Proceedings of EAMT 2014. 

MAKKAI, ADAM. 2009. On redefining the idiom. LACUS forum 36:215-27. 

MELBY, ALAN. 1990. The mentions of equivalence in translation. Meta: Translators’ journal 35(1):207-13. 
—. 1995. The possibility of language. Amsterdam: John Benjamins. 

MELBY, ALAN, ALAN MANNING, LETICIA KLEMETZ. 2005. Quality in translation: A lesson for the study 
of meaning. Linguistics and the human sciences 1(3):403-46. 


HUMAN AND MACHINE TRANSLATION QUALITY 43 


MELBY, ALAN & CHRISTOPHER FOSTER. 2010. Context in translation: Definition, access And 
Teamwork. Translation & Interpreting 2(2):1-15. 

MURPHY, CAIT. 1995. “Ulysses” in Chinese: The story of an elderly pair of translators and their unusual 
bestseller. The Atlantic, September. http://www.theatlantic.com/past/docs/issues/95sep/ulyss.htm 
(accessed June 2014). 

NORD, CHRISTIANE. 1997. Translation as a purposeful activity. Manchester: St Jerome Publishing. 

OSLTER, NICHOLAS. 2010. The last lingua franca. New York: Walker & Company. 

PARAMETERS. 2013. Structured translation parameters, version 6. http://www.ttt.org/specs (accessed 
August 2013). 

PYM, ANTHONY. 2010. Exploring translation theories. London: Routledge. 

QT LAUNCHPAD. http://www.qt21.eu/launchpad/. 

RADOSH, DANIE. 1999. Why American kids don't consider Harry Potter an insufferable prig. 
http://www.radosh.net/writing/potter.html (accessed June 2014, originally published in The New 
Yorker, Sept. 20, 1999). 

TOLKEIN. J.R.R. 1965[1955]. The return of the king. New York: Ballantine. 

TURING, ALAN. 1950. Computing machinery and intelligence. Mind 59(236):433-60. 

VENUTI, LAWRENCE. 1998. The scandals of translation: Towards an ethics of difference. London: 
Routledge 

—. 2009. Translation, empiricism, ethics. Paper presented at the 2009 MLA Conference, Philadelphia, 
December 28. 

WEAVER, WARREN. 1955[1949]. Translation. In Machine translation of languages: Fourteen essays, ed. 
by William N. Locke & A. Donald Booth, 15-23. Cambridge MA: Technology Press of the 
Massachusetts Institute of Technology. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:4.44-54 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution- NonCommercial license version 3.0; see 





Attp://creativecommons.org/licenses/by-nc/3.0/. 
Copyright © 2020 the Linguistic Association of Canada and the United States. 


FORM VERSUS CONTENT AND INPUT VERSUS OUTPUT: 
THE DUAL BIFURCATION AND INTERDEPENDENT 
SYNERGY OF CONSTRUING AND SAYING 

H. STEPHEN STRAIGHT 
Binghamton University, State University of New York 


Abstract: Building upon Saussure’s distinction between signifiant (‘signifier’) and signifié (‘signified’), 
linguists recognize an undeniable bifurcation between form (overt appearance) and content (conveyed 
meaning) in language. With few exceptions these two aspects of language require separate description and 
analysis. On the other hand, despite the input- output bifurcation captured by Saussure's distinction between 
Jangue and parole, most theories of language assume that the objects of linguistic investigation are the same 
for listening as for speaking. This assumption persists despite the large and growing body of empirical and 
theoretical evidence in favor of a thoroughgoing comprehension-production bifurcation of language. The 
present Presidential Address lays out a dialectical, synergistic theory of language that recognizes both the 
form-content bifurcation and the input-output bifurcation of language to arrive at the RIFE Model of 
Language: Receiving versus [nterpreting—the form-versus-content components of Construing (input) —and 
Formulating versus Executing—the content-versus-form components of Saying(output). 


Keywords: Construing versus Saying, form versus content, duality of patterning, input versus output, 
listening versus speaking, comprehension versus production, the RIFE Model of Language, dialectical- 
processual linguistics. 


MOST PAST AND PRESENT THEORIES OF HOW LANGUAGE WORKS make the supposedly 
parsimonious assumption that the objects of linguistic study (phonetic features, sounds, semantic 
categories, words, morphological classifications, phrases, syntactic constructions, clause types, sentences, 
pragmatic functions, paragraphs, discourses, genres, etc.) are the same for listening as for speaking. (By 
substituting visuo-motor specifications for the auditory- articulatory specifications that characterize 
listening and speaking, linguistic objects are typically much the same for reading as for writing, though a 
number of aspects of linguistic forms and sometimes meanings do differ across the two modes of language.) 
Language users, most theorists say, acquire and employ structural knowledge—whether in the form of rules, 
interconnected nodes, lexical items and their phrasal properties, or other input-output- neutral 
representation—governing the overt perceptible or measurable physical character and inferable intended 
or ascribable meaning of linguistic events, including in a unitary analysis their properties both as acts of 
sensory perception and motor production (form) and as acts of verbal construction and cognitive 
interpretation (content). Adopting, explicitly or implicitly, Noam Chomsky's famous (1965) distinction 
between competence and performance, most linguists treat the many dissociations or interactions, and 
imbalances or incongruences, between receptive and expressive processing as byproducts of the vagaries of 
performance or of the language-acquisition process. Competence is process-neutral; performance is 


THE DUAL BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING 45 


process-disrupted. 

This Presidential Address offers a host of examples of both the separateness and the interdependence 
of myriad aspects of the Construing-Saying divide in language knowledge, behavior, and acquisition, and in 
so doing lays out a “dialectical-processual” approach to all aspects of language. 


1. INPUT AND OUTPUT ARE JUST AS DISTINCT AS FORM AND CONTENT. All theories of how 
language works have recognized an undeniable bifurcation between form and content. Except for 
interesting exceptions such as onomatopoeia and other iconic properties of words, intonations, and 
constructions, the auditory or articulatory features that characterize the sounds of a language and the linear 
and other features that characterize larger linguistic elements exhibit no fixed one-to-one relation to the 
semantic features that characterize the meanings of those various constituents. The form-content pairings 
that characterize the fundamental elements of a language, at all levels of description, are largely arbitrary 
and culturally transmitted, while at the same time, the composition and combinations of elements found in 
the separate domains of form and content exhibit non- arbitrary regularities of patterning of great intricacy 
and importance. Languages have systems of sounds and systems of meaning, although the arbitrary pairings 
of elements in these systems, once they have been established, exhibit considerable power. No one seriously 
challenges the bifurcation of form and content in language. 

On the other hand, most theories of language assume that the objects of linguistic investigation 
(phonetic features, sounds, semantic categories, words, etc.—see above) are the same for listening as for 
speaking. Dismissing the well-known dissociation between receptive and expressive processing revealed 
respectively in Wernicke’s (posterior) and Broca’s (anterior) aphasic brain damage, and willfully denying 
or misrepresenting the receptive-expressive dichotomy based explicitly on this clinical neuroanatomical 
evidence in Ferdinand de Saussure's famous (1916) distinction between /angue and parole, most linguists 
have posited the existence of a processing-neutral, or fully bidirectional, representation of language. 
Ancients and moderns, structuralists and generativists, formalists and functionalists, all hold that 
underlying sets of linguistic elements, governed by a set of generative rules, combinatory constraints, 
admissibility conditions, sequential dependencies, optimality conditions, or other constructive principles 
constitute the cognitive-psychological basis for language ability, regardless of whether that ability is viewed 
from the perspective of the receiver or the producer of linguistic events. Language knowledge is assumed to 
be immune to the contrasting demands and tendencies of audition versus articulation, interpretive flexibility 
and versatility versus formulative habits and idiosyncrasy, perceptual distinctiveness in recognition and 
parsing of input versus effort- reducing and fluency-increasing (but ambiguity-introducing) simplification 
in construction and pronunciation, and numerous other striking contrasts across the receptive-expressive 
divide, many of which I will identify in what follows in this address. 

Ironically, perhaps the most persuasive reasons to describe receptive and expressive processing 
separately derive not from their constant competition and resulting perpetual undermining of each other’s 
stability due to conflicts and differences but rather from their codperative engagement. Construing typically 
involves the use of the receiver's expressive system as the basis for moment-by-moment anticipation of input 
and input-based linguistic inference, while Saying typically involves the use of the interpretive system as an 
ongoing monitor to gauge the accuracy and comprehensibility of output. For example, the listener who 
confidently corrects another's utterance thereby demonstrates that the utterance was fully interpretable but 
that the listener's own expressive processes would realize that same meaning in a different way. (The 
longstanding use of grammaticality judgments as a basis for deciding what should and should not be 
generated by a grammar thus guarantees that the grammar will exhibit a bias toward language as output- 
production over input- perception, leaving the latter dangling in theoretical and descriptive limbo.) This 
and other examples of the ubiquitous intermingling of the processes of Construing and Saying in language 


46 H. STEPHEN STRAIGHT 


use and acquisition make it necessary to include in the account of language knowledge a constant, powerful, 
and irresolvable dialectical synergy between reception and expression. 


2. LINGUISTICS HAS BEEN ON A WILD GOOSE CHASE FOR SEVERAL GENERATIONS. Linguists 
typically aim to capture their insights into the workings of language in a “grammar”, a theoretical construct 
which I dubbed “the Myth of G” fifteen years ago in a LACUS presentation (Straight 1999). As a set of rules 
that specify the members of an infinite set of linguistic objects, grammar has proven inadequate (or, even 
worse, an ever more mysterious black box, most recently defined in metatheoretical and increasingly less 
process- implementable “minimalist” terms) as an explanation of how human beings express their ideas in 
words or understand the words of others. In other places and at other times I have laid out the ways in 
which “grammar” fails as anything other than a mathematical- logical account of language knowledge (see 
Straight 1993), as worthless as a means for Construing or Saying as the Pythagorean theorem would be as a 
means for recognizing or drawing right triangles. This is not to deny that the Pythagorean theorem (x* + y” 
= 2°) captures “truths” about triangles (in two-dimensional Euclidean space, itself a theoretical construct 
long recognized as a “myth” inferior to its mythic successor, four-dimensional Einsteinian space). Similarly, 
a grammar (to varying degrees) captures things that are “true” of language (when seen as strings of labeled 
and bracketed symbols in the one-dimensional space defined in the Chomsky hierarchy of formal 
grammars). In brief, linguistics, the scientific study of language, has—from its beginnings—largely accepted 
the Myth of G and ignored the input- output dichotomy between processes of language comprehension and 
processes of language production. 

For my present, limited expository purposes let me cut to the chase: Figure 1—which I will refer to 
repeatedly as I proceed—spells out much if not all of what this LACUS Presidential Address has to offer as 
a way out of the dead end in which many self-styled contributors to “the scientific study of language” 
currently reside, whether they know it or not. 

The remainder of this address will outline why and how to reject our field’s longstanding focus on 
language as a set of abstract objects defined (or “generated”, in the set-theoretic mathematical sense) by an 
even more abstract set of principles (or “rules”). In place of such abstruse theorizing, linguists will do well to 
adopt a view of language asa four- fold set of neuropsychological receptive and expressive processes whereby 
people: 


(1) recognize auditory or visual inputs as linguistic events (=Receiving), 
(2) ascribe meaning to these recognized forms (=Interpreting), 
(3) construct meaningful forms of their own (= Formulating) and realize these constructions as vocal or 


manual outputs (= Executing). 


These four processual components derive from two dichotomies: Form versus Content and Input versus 
Output. 


THE DUAL BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING 47 


saying ; Construing 
Thinking 





























Pragmo-Semantic Intention semanto-Pragmatic Construal 


Interpreting 











Formulating 














Lexico-Syntactic Representation Phono-Lexical Representation 


premonitoring 








Executing 


Receiving 


MotorOutput Linquistic Sensory Input 
Event 
































Figure I. The RIFE Model of Language Processes and Their Interaction 


2.1. THE FORM-CONTENT DICHOTOMY IN LANGUAGE. In 1943, the year I was born, the Danish 
linguist Louis Hjelmslev identified two “planes” of language, the expression (or form) plane and the content 
plane (Hjelmslev 1943). The French linguist André Martinet referred to essentially this same distinction 
when he divided language into meaningless building blocks (i.e., phonemes) and meaningful chunks (i.e., 
morphemes) and dubbed this the “double articulation” of language (Martinet 1949), and the American 
linguist Charles Hockett captured it in the phrase “duality of patterning” (Hockett 1960). Although this 
dichotomy sometimes breaks down in the face of onomatopoeia and other phonetic, morphological, and 
syntactic iconicity, in which a given sound, morphological structure, or syntactic construction bears 
synesthetic or other similarity to the meaning it evokes, this form-versus- content disjunction is recognized 
in all current linguistic frameworks I know of. Within each of these domains (form and content) we find 
regularities of patterning (e.g. phonotactic and taxonomic) of great intricacy and importance; many 
individual linguists spend their entire scientific career exploring just one of these two “planes”. However, 
once a particular form-content relation has been established it tends strongly to constrain the establishment 
of new relations between a given overt element and other meanings, because of the ambiguity that results 
from such double connections; this constraint greatly limits the “arbitrariness” of the form-content relations 
in a given language. These principles of course apply fully to the RIFE model (Fig. 1), in which the interface 
between the two planes lies along the horizontal line between the external forms of language (in Receiving 
and Executing) and their internal content (Interpreting and Formulating). (Note that for the language user 
the only consciously perceptible component in this formidable array is the “Phono-Lexical Representation”, 


48 H. STEPHEN STRAIGHT 


whether by means of Receiving or Premonitoring; the rest are imperceptible, however confidently they may 
be inferred. Arguments about what someone said or did not say in the end devolve to what we can remember 
or reproduce regarding this immediately perceptible product of the processes of Receiving.) 


2.2. THE INPUT-OUTPUT DICHOTOMY IN LANGUAGE (CONSTRUING VERSUS SAYING). 
Considering their universal recognition of the form-content dichotomy, it is surprising that linguists 
typically see the obvious input-output dichotomy not as an inherent property of language but rather as a 
product of the vagaries of language acquisition and use. The arguments appear to go as follows: 


e Yes, learners acquire receptive ability before expressive ability, but whether they are listening or 
speaking the “underlying” elements of language are (of course) shared between listening and 
speaking. 

e Yes, dissociations and asymmetries between comprehension and production persist throughout a 
language user's life, but these are due to factors peripheral to the internal “grammar” upon which 
all language use depends. 


The ready dismissal of the input-output dichotomy by linguists contrasts starkly with its wide recognition 
by (other) cognitive psychologists. Consider the following examples of this dichotomy in a number of areas 
of human psychology: 


e Recognition Memory versus Recall Memory 

e Perception vs. Action (cf. Knowledge vs. Intention) 

e Cognition versus Conation (Intellect versus Will) 

e Posterior (temporal, parietal, occipital) versus Anterior (frontal) Cortical Lobes 
e Illusion and Hallucination versus Inhibition and Compulsion 


Within the domain of language, the input-output bifurcation takes many forms: 


e Monitoring and Pre-monitoring versus Anticipation and Rumination 

e Automatic-Divergent “Spreading Activation” versus Reflexive-Convergent “Selection” 

e Dialectal Receptive Multiplicity and Versatility versus Expressive Specificity and Idiosyncrasy 
e Auditory Confusability versus Articulatory Slips 

e Inference versus Implication 

e Lexical Ambiguity versus Lexical Selection 

e Grasping Metaphor versus Employing Metaphor 

e 1L2>L1 Translation Ability Superior to L1> L2 Translation Ability 


However, for all their separateness, input and output areinterdependent! 


e Construingtypically requires the receiver to use the expressive system to anticipate what's coming and 
to grasp the input’s implications. 

e Saying typically requires the transmitter to use the interpretive system as a gauge of accuracy and 
comprehensibility. 

e Saying, without such monitoring, favors simplicity over complexity, while Construing, even when 
accompanied by intense active listening, favors clarity over ambiguity. 


THE DUAL BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING 


2.3. CONCLUSION: THE INEVITABLE RECOGNITION OF THE DUAL BIFURCATION 


LANGUAGE. Linguists will in time succumb to the force of undeniable psycholinguistic evidence, 
recognize the constant, powerful, and irresolvable separateness—and dialectical interdependence and 


synergy—of the processes of Construing and the processes of Saying, and reject the Myth of G. 


3. EXAMPLES OF EVIDENCE FOR AND NEGLECT OF THE INPUT-OUTPUT, RECEPTIVE- 
EXPRESSIVE BIFURCATION AND THE INTERDEPENDENT SYNERGY BETWEEN CONSTRUING 


AND SAYING 


3.1. EXAMPLE 1: FIRST LANGUAGEACQUISITION 


3.2. EXAMPLE 2: ACCOUNTING FOR STRUCTURAL COMMONALITIES BETWEEN RECEPTIVE AND 


SOURCES: 


STRAIGHT 1980a: Auditory versus articulatory phonological processes and their development in 
children 

STRAIGHT 1980b: Cognitive development and communicativeinteraction as determinants of the 
emerging language abilities of children 


“The view presented here ... challenges the long-standing and widely held assumption that there is 
some of phonological representation that is common to perception and production. ... [T]hat 
assumption is wrong. ... Analysis occurs by analysis and synthesis by synthesis: That is my thesis in 
a nutshell ... °(1980a:43-44). 

“Even with totally prelinguistic infants and very young babies [to say nothing of older children], it 
seems that adult output [addressed to children] is constantly shaped into a form ... consonant with 
the listener’s [inferred] receptive abilities .... This ... suggests that the most important 
developments in child language, from the standpoint of interaction, occur in the area of language 
comprehension rather than language production” (1980b:151). 

Despite the above, “[t]erms such as performance, language knowledge, language learning, linguistic 
skills, verbal sophistication, language stage, and acquisition of formal devices are used without 
qualification to summarize findings based strictly upon language production. It is as if 
comprehension were ... so simple, or so strongly linked to production, as not to be worth 
mentioning” (1980b:151-52). 


EXPRESSIVE PROCESSES 


SOURCE: 


STRAIGHT 1980c: Structural commonalities between comprehension and production 


“It is not necessary ... to posit any processually neutral, ‘structural’ principles that 
language users must know in order to keep their two sets of language processes [— 
receptive and expressive—] in line with each other. Such nonprocessual principles may 
have value as heuristics and as descriptive tools ..., but the need for them as 
psychological constructs can be obviated by including monitory and anticipatory 
interactions in our account of language processes. These interactions result in a rough, 


50 H. STEPHEN STRAIGHT 


gradual, and always incomplete mutual adaptation between comprehension and 
production processes” (316). 


3.3. EXAMPLE3: THE CREA TIVEASPECT OF LANGUAGE 


SOURCE: 
e STRAIGHT 1982: The formulation-interpretation circuit: A linguistic motor for the creation of 
meaning 


Is human consciousness 
Entirelysupertluous, orisit 


The useful tool of what it does not know? 
Quentin Vest, 1979' 


Words are the eyes of the imagination before they are its voice. 
Marvita Belle Kreag Eley, 19887 


“[T]here are many reasons to believe that the processes of language formulation yield output forms that are 
nearly always interpretable in many ways other than what would be necessary to capture exactly the idea that 
gave rise to them. This tension between the ideational sources of a form as output and the ideational effects of 
that ‘same’ form as input is a powerful source of human creativity, both individual and social” (126). 


3.4, EXAMPLE 4: A PRELIMINARY SUMMINGUP 


SOURCE: 
° STRAIGHT 1986b: The importance and irreducibility of the comprehension/production dialectic 


Structural linguistics (which includes all linguistics of the past century that has embraced the Myth of G) is 
to dialectical-processual linguistics (the RIFE Model of Language) as Ptolemaic astronomy is to Copernican 
astronomy: “The dialectical-processual view of language bears much the same relation to the dominant 


‘ Quentin Vest and I both served as Fulbright lecturers in the English department of the University of Bucharest, Romania, for the 
1979-1980 academic year. He taught American literature and I taught psycholinguistics. We became well acquainted and he shared 
with me many of the poems he had written. I have searched high and low for the lines quoted here--which I obviously took close note 
of upon first reading--but have not been able to find the source poem. I recognized some of thedozen or so published poems that I 
found, but none contained these lines. I tried to track Quentin himself down, but have been unable to determine what became of 
Quentin after his Fulbright appointment. Everything I have found, whether online or in print publications, including materials from 
Longwood University, in Farmville, Virginia, where he was teaching when he received his Fulbright award, predates his time in 
Romania. 

* Marvita Eley (1917-1996) was my "other mother": the female half of the couple my parents and I moved in with in 1944, the year 
after I was born in University Hospital, Ann Arbor, Michigan. Not long after, all four adults having received masters degrees in 
English from the University of Michigan, the now-joint families moved to Lincoln, Illinois, to take on a shared assignment to teach 
and to radically revise the Lincoln College liberal arts curriculum. After my parents separated when I was five, my mother Madeline 
and I continued to live with the Eleys and their son Jonathan, my "cousin- brother" who was eight months older than me. Growing up, 
I loved listening to the adults read poetry, including many written by the Eleys, Earle and Marvita. Marvita read widely in the 
philosophy and psychology of language. In fact, interactions with her and my father Sidney, who pursued but did not complete a 
doctorate in English linguistics, spurred me to become a theoretical psycholinguist. The quoted lines from one of her unpublished 
poems, the full text of which I have not been able to find in our Eley-Straight family files, were, to the best of my recollection, shared 
with me during one of my many visits home, probably in the mid-'70s or early '80s. 


THE DUAL BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING a1 


structural view of language that Copernican (heliocentric) astronomy bore to Ptolemaic (geocentric) 
astronomy ... . The dialectical alternative [to structural theory] is ... just as contrary to common sense as 
the Sun-centred view of celestial objects. And structuralist assumptions about language are just as 
commonsensical and just as wrong as Earth-centred conceptions of the heavens” (72-73). 


Asymmetries exist between receptive and expressive lexicons —- most obviously in children, but adults have 
them, too. People “understand words they don’t produce ..., but the reverse—producing words they don’t 
understand—also occurs, and much more frequently than is usually recognized” (83). 


The perpetual discrepancies and dynamic interplay between receptive and expressive processing provide 
much of the power and utility of language as a cognitive tool; to posit the a priori existence or developmental 
emergence of a set of rules that is neutral regarding these competing processual components is not only 
superfluous but actually harmful to our understanding of the “creative aspect of language use”, which 
Chomsky (1966) wrongly attributes to the infinitude of the set of sentences generated by a grammar (86- 
87). 


3.5-8. EXAMPLES 5, 6, 7 & 8 


SOURCES: 

° STRAIGHT 1986a: Goals, methods, and theory in language instruction; 

° HERBERT & STRAIGHT 1989: Compliment-rejection versus compliment- avoidance: 
Listener-based versus speaker-based pragmatic strategies 

9 STRAIGHT 1992: Processing: Comprehension and production 

e STRAIGHT 1993: Processualism in linguistic theory and method 


Implications of dialectical-processual theory for ... 
@ Language instruction 


o Development of comprehension skills should be the prime goal of the language learner; 
production skills take care of themselves. 


9 Listener-based versus speaker-based pragmatic strategies 


o For example, compliments and compliment-responses in the United States of America and 
the Republic of South Africa 
e The history of linguistics: the integrity of the “sign” (mutually implied “signifier” and “signified” 
that are the same for speaking and listening) 
o Is Saussure to blame? He does occasionally speak this way in the Cours (1916). 
° Linguistic theory and method: “Once you've missed the first buttonhole, you'll never manage 
to button up!” (Goethe) 
o Set theory’s to blame! Seeing linguistics as a special case of symbolic logic or set theory ruins 


everything. 
3.9. EXAMPLE 9: NEUROPSYCHOLOGICAL CORRELATES OF THE RIFE MODEL 


SOURCE: 
e STRAIGHT 1999: Central aphasia and the Myth ofG 
No, Saussure’s not to blame! Actually, we now know (Godel 1957) that the Cours, compiled by his students 


52 H. STEPHEN STRAIGHT 


after his death, misrepresented his view, which explicitly rejected a bidirectional relationship between a 
signifierand a signifié. He had it right: Construing= langue, Saying= parole, and the two reside in different 
parts of the neural “speaking circuit” (circuit de parole). 


“Currently, production and comprehension are regarded as quite distinct in accounts of language 
processing. In rejecting this dichotomy, we instead assert that producing and understanding are 
interwoven, and that this interweaving is what enables people to predict themselves and each other” (the 
last two sentences of the abstract to Pickering & Garrod 2013:329, henceforth P&G). 


Constant interaction between receptive and expressive components of language ability is well described 
in P&G’s work. However, in their account of production and comprehension processing a non-directional 
linguistic component (i.e. a “Grammar”) provides the form and content for everything that is “processed”. 
For P&G, as for almost everyone else, the root “knowledge” that makes language processing possible is 
shared between input and output processing and, as for most psycholinguists, takes the form of a black-box 
grammar. The separateness of input and output processing that P&G recognize is thus a matter of 
“performance” rather than “competence”, and the “integration” that they describe, while maintaining 
allegiance to the Myth of G, does capture much of what I have been calling the synergistic aspects of 
“dialectical-processual” analysis. Fortunately, much of the work done by G-believing psycholinguists is rich 
with implications for the details of a D-P model of language. Petra Hendriks’s recent book on Asymmetries 
between language production and comprehension (2014) is an excellent example. 


4. AFINALLOOK AT THE RIFE MODEL OF LANGUAGE PROCESSES AND THEIR INTERACTION. 


4.1. WHY “RIFE”? 


° These letters refer to the four separate but interacting components of language processing. 
O Each component is internally complex, with multiple “strata”, but unidirectional (except for 
component-internal feedback loops, in addition to feedforward connections, between strata). 
(Cf. Lamb, Sullivan, and others familiar to my LACUS audience.) 
@ The four components (necessarily) emerge (in language acquisition), and are decreasingly 
shared (and increasingly idiosyncratic), in this order. 
O R-I-F-E(R&I = /angue, F&E = parole) 


4.2. WHY DOES THE RIFE DIAGRAM IN FIGURE 1 READ COUNTER-CLOCKWISE BEGINNING 
IN THE LOWER RIGHT? We are accustomed to the practice of putting the content plane above the form 
plane, on the grounds that content is processed in the “higher” regions of the central nervous system while 
form is processed in the “lower” regions closest to the peripheral components of our nervous system. But: 


e Why does Saying (F&E) appear to the left of Construing (R&I) in the RIFE diagram? 

e Why does RIFE read from right to left and proceed in a circular manner? 

e Why not make our customary left-to-right linear reading order correspond with early- to-late 
temporal order of emergence (RI before FE) and order of increasing idiosyncrasy (with RI— /angue— 
being a shared “social fact” in Saussure’s terms, while FE— paro/e— provides individual creativity 
and is the source of language change)? 


THE DUAL BIFURCATION AND INTERDEPENDENT SYNERGY OF CONSTRUING AND SAYING 53 


Answer: The diagram reflects the predominantly left-hemispheric neuroanatomy of language: The 
diagram presents the four components of the RIFE Model as if they were superimposed on an image of the 
left hemisphere of the cortex, which is predominant in linguistic processing, though that predominance is 
widely overstated. (The right hemisphere plays a major role in Construing and Saying, but that’s a whole 
nother story.) 


REFERENCES 
(PLUS ADDITIONAL TREATMENTS OF THIS TOPIC BY THE AUTHOR) 


CHOMSKY, A. NOAM. 1965. Aspects of the theory of syntax. MIT Press. 

---. 1966. Cartesian linguistics. Harper and Row. 

GODEL, ROBERT. 1957. Les sources manuscrites du Cours de linguistique générale de F. de Saussure. Genf: 
Droz. 

HENDRIKS, PETRA. 2014. Asymmetries between language production and comprehension. (Studies in 
Theoretical Psycholinguistics 42.) Springer. 

HERBERT, ROBERT K., & H. STEPHEN STRAIGHT. 1989. Compliment-rejection versus compliment- 
avoidance: Listener-based versus speaker-based pragmatic strategies. Language and Communication, 
9: 35-47. 

HJELMSLEV, LOUIS. 1943. Omkring sprogteoriens grundleggelse [Prolegomena to a theory of language]. 
Copenhagen: Ejnar Munksgaard. 

HOCKETT, CHARLES F. 1960. The origin of speech. Scientific American 203:88-101. 

MARTINET, ANDRE. 1949. La double articulation linguistique. 7ravaux du Cercle LINGUISTIQUE DE 
COPENHAGUES5:30-38. 

PICKERING, MARTIN J. & SIMON GARROD. 2013. An integrated theory of language productionand 
comprehension. Behavioraland Brain Sciences36:329-92.(With extensive Open Peer Commentary). 

SAUSSURE, FERDINAND DE. 1916. Cours de linguistique générale. Paris: Plon. 

STRAIGHT, H. STEPHEN. 1971. On representing the encoding/decoding dichotomy in a theoryof 
idealizedlinguistic performance. PapersfromtheSeventhRegionalMeeting, Chicago Linguistic Society, 
5935-42. 

---. 1975, Performance development: How to account for the disparity between comprehension and 
production throughout the course of language development. (Abstract) Papers and Reports on Child 
Language Development 10:243-44. 

---. 1976a. Comprehension versus production in linguistic theory. Foundations of Language 14:525-40. 

---. 1976b. Decompositional structure in Yucatec verbs. In Marlys McClaran (ed), Mayan linguistics, 
Volume 1: 189-201. UCLA American Indian Studies Center, Los Angeles. 

---. 1977. Psycholinguistics, a review essay. Canadian Journal of Linguistics 22:169-95. 

---. 1980a. Auditory versus articulatory phonological processes and their development in children. In 
Grace H. Yeni-Komshian, James F. Kavanagh, & Charles A. Ferguson (eds), Child phonology, 
Volume 1: Production, 43-71. Academic Press. 

---. 1980b. Cognitive development and communicative interaction as determinants of the emerging 


54 H. STEPHEN STRAIGHT 


language abilities of children. (Invited review article) International Journal of Psycholinguistics7:143- 
67. 

---, 1980c. Structural commonalities between comprehension and production. Revue de Phonétique 
Appliquée 55/56:313-16. 

---, 1982. The formulation-interpretation circuit: A linguistic motor for the creation of meaning. 
Quaderni di Semantica 3:123-128. 

---, 1983a. Grammatical rules, psychological reality of. In Rom Harré & Roger Lamb (eds) Encyclopedic 
dictionary of psychology, 258. Blackwell’s. 

---. 1983b. Review of Thomas A. Perry (ed) (1980) Evidence and argumentation in linguistics. 
International Studies in Philosophy 15:105-108. 

---, 1984, Language and the cognitive breakthrough at age six. In Carol Larson Thew& CarolynEchols 
Johnson(eds), Proceedings oftheSecondInternationalCongressfor the Study of Child Language, 
Volume 2, 220-31. University Press of America. 

---. 1986a. Goals, method, and theory in language instruction. In Francis Lowenthal & Fernand 
Vandamme (eds), Pragmatics and education, 183-202. Plenum Press. 

---. 1986b. The importance and irreducibility of the comprehension/production dialectic. InGraham 
McGregor (ed), Language for hearers (Language and Communication Library 8), 69-90. Pergamon 
Press. 

---, 1992. Processing: Comprehension and production. In William Bright (ed), /nternational encyclopedia 
of linguistics, 3:271-73. Oxford University Press. 

---, 1993. Processualism in linguistic theory and method. In Rom Harré & Roy Harris (eds), 

-- Linguistics and philosophy: The controversial interface (Language and Communication Library 13), 199- 
216. Pergamon Press. 

---, 1999. Central aphasia and the Myth of G: Toward a grammar-free linguistics. (Annual Meeting, 
Linguistic Association of Canada and the United States (LACUS), Claremont, CA, 28 July-1 August 
1998) In Shin Ja J. Hwang & Arle R. Lommel (eds), LACUS Forum XXV, 331-47. LACUS. 

---, 2000. Speaking and comprehending: Two underlying systems. (Abstract) In Victor H. Yngve & 
Zdzistaw Wasik (eds), Exploring the domain of human-centered linguistics trom a hard-science 
perspective, 21-22. The School of English, Adam Mickiewicz University, Poznan, Poland. 

---, 2010. Review of Richard A. Hudson (2007) Language networks: The new Word Grammar. Studies in 
Language 34.2:482-86. 


This article was first published at lacus.weebly.com. 


LACUS Forum 41:4.55-76 (2020) 


LACUS Forumis published on-line at lacus.weebly.com, where an electronic copy is provided, free 
of charge and with no implied warranty. Its contents are available to you under the terms of the 
Creative Commons Attribution-NonCommercial license version 3.0; see 
Attp://creativecommons.org/licenses/by-nc/3.0/. 

Copyright © 2020 the Linguistic Association of Canada and the United States. 





ON RHYTHM AND INTONATION IN THE SPEECH OF BARACK OBAMA 


Lucas van Buuren 
University of Amsterdam (retired) / Linguavox, Bloemendaal 


Abstract: The aim of this article is to show that rhythmic (besides intonational) choices have far-reaching 
semantic effects on the textual/deictic, (inter)personal and even on the ideational/representational 
planes, hence on syntax. 

Postulating a Rhythmic Alternation Axiom (RAX) saying no more than two weaker beats between 
stronger ones and no more than one after/before pause, this entails: no more than 9 groupings with one 
ictus + remiss(es) and hierachies of such groups within groups. Combined with an Armstrong &Ward 
(1926) dots&dashes notation of pitch patterns, this therefore strongly constrains the analysis of the 
rhythm and intonation of any English utterance. As a practical exercise for linguists working on speech 
such an analysis is demonstrated of an impromptu reply’ by President Obama to a question put to him 
in a tv interview. Pursuing a so-called form«>+meaning approach the function or meaning of each 
rhythmic-intonational option is then stated, in principle, and in so far as within our abilities, in terms of 
speaker neurocognitive processes. 


Keywords: Barack Obama, rhythm, rhythmicality, intonation, form-meaning, dots&dashes, BTDFPM, 
bostrodac-fypesmo, neurocognitive, evolution. 


1. INTRODUCTION. THE TITLE OF MY ORAL PRESENTATION at the 2014 LACUS Forum was 
Rhythm is Syntax. Since it presupposed a rhythmic-cum-intonational analysis, one is included here in 
the form of a practical exercise on a spoken text. Hence the change in title. 

The text selected is a one-minute answer in spontaneous, colloquial English, by President Obama, 
to a question put to him by Steve Kroft in a 60-minute CBS interview on September 28, 2014, published 
online on YouTube. Rather than analysing the speech of an anonymous British speaker for instance, the 
present text was preferred because the LACUS audience are largely North American, everybody knows 
the speaker and his speech, and being online in the public domain it was immediately available (even 
in Bloemendaal, NL). The deciding factor, however, was that I have always enjoyed listening to 
Barack Obama and admire him also as a person. I can only hope that he does not mind contributing a 
little to linguistic science in this way. 

The present rhythmic/intonational/semantic theory and analysis derive from the work and 
teaching in the 1960s Edinburgh School (David Abercrombie 1964a/b, Michael Halliday 
1961/63/75/85) and also from that of Nel Keijsper (1985) on information theory and Sydney Lamb 
(1999) on neurocognitive linguistics. Its linguistic theory of rhythm was further developed on the basis 
of Cooper&Meyer's (1960) The Rhythmic Structure of Music and Paul Fraisse’s (1974) Psychologie du 
rythme. 


' Obama’s reply to Steve Kroft begins at approximately 1 minute and 27 seconds into the YouTube video posted here: 
https://www.youtube.com/watch¢v=AL5kY]Xiohc 


ee LUCAS VAN BUUREN 


The essence of this article is the line-by-line analysis in section 2 of the form and meaning of 
rhythm and intonation in the mp3 recorded text, with which it should be compared. As a first 
introduction to the symbols and their meanings used in the analysis in the next section, the following 
example (with i standing for I), taking nearly 4 seconds to say, may be briefly discussed. Argumentation 
and more detail in the (ad hoc) comments on each line and also in AppendixI. 


(1) .1/Should +L1ke|some .eggsiand -bacon|th1s -morning.|/.1f/1 -may # 


Ignoring the symbols | and |for the moment, #, //, |and space/blank mark the boundaries, respectively, of 
the rhythmic-phonological units: locution, piece, byte, and word. Using < for “combining into’ this 
example has 12 words < 4 word-constellations or bytes < 2 byte-constellations or pieces < 1 piece- 
constellation or loc. The corresponding meanings are: 12 concepts < 4 concept-constellations or 
thoughts < 2 thought-constellations or ideas < 1 idea-constellation orsententia. 

From a processing, neurocognitive (nc) point of view, a speaker first chooses words (i.e close 
juncture units symbolising concepts) for referents he/she ‘perceives’ (e.g. like, , should). Words are then 
combined into ‘thoughts’ or bytes. A byte is a single nc/bodily gesture or rhythm-wave with one S(trong) 
beat/ictus or stress. The reader may find it rather impossible to say the example without (at least) some 
head or eyebrow ‘gesture’ on the S syllables of the S words, i.e. on Like, ba-, mor-, may. Further evidence 
of the unity of brain and body gesturing are the fascinating, often quite energetic, eye-face-hand... 
movements on S words we see from people talking on mobile phones. The non-S syllables/words in 
each byte take a rhythmic offbeat or remiss withoutaccompanying hand or facial gestures. The temporal 
juncture between bytes/rhythm-waves is slightly opener than that between the words it contains. 

The semantic status or meaning (i.e. beside the dictionary meaning of each word) of an S word/byte 
is ‘pointing’ at, or S(pecification) of its referent (constellation). If an (upward or downward) pitch- 
jump is made on the S syllable, as on //ke in the example, it becomes T(onic). The meaning of a T 
word/byte is: ‘not that but this’ i.e. ‘contrast’, i.e. ‘new’. Any S can be made T. The last tonic in a piece 
is known as the nuclear T. This can be E(arly), i.e. followed by one or more S bytes, as J should /ike in 
the example, or L(ate). The meaning of E is (new) discovery/revelation within the following S(pecified) 
not-new thought(s), i.e. within a not-new idea or situational context. The meaning of Lis (new) creation 
ofa piece-final thought, hence ofan idea/context, as when this morningis made tonic. Compare also ES 


the -~queen| 1S on .tele.vision.// vs TL /the -queen| 1S 
on-teLe.v1ston._/ , noting, beside the (not-)newness of the final byte, the 
correpondence between the in-line and dots-and-dashes (d&d) notations. 

It may already have been noticed that each piece ends ina . for downturn or F(alling) pitch or in a 
. for upturn or R(ising) tune. As still to be be shown, either symbol is placed high or low depending on 
the preceding pitch. F does not mean ‘statement intonation’ as is often said, but rather: alternative 
pieces/ideas neuro-cognitively blocked, i.e. ‘finality’. Nor does R mean ‘question intonation’ but 
simply: not F, i.e. no neural blockage, hence open to alternative ideas. Compare R and -whats| 
your.name./ >=[=aIvs F Jand whats| your .name./ 7=T_T. 

It is also to be noted that three different signs are used to mark T syllables, « for an upward-jump- 
then-fall, - for an upward jump only, and . for a downward jump only. These may be referred to as the 
plus, equal and minus tones. A +tone means committed, an =tone equivalent, uncommitted, and a - 
tone expective, evident choice of word. The bigger the jump, the more committed, uncommitted, 
evident the choice. Some useful examples to practise, with appropriate facial and arm gestures are: 


(L+) #1 ~*Love you./-darLing.# =x. [_. #,(L=)#1”love youy-darling-y 7-7] — 74, 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 27 


(L-) #1 .love you./-darling.# =. [_. #.A +tone is typically accompanied, we then find, by a 
downward ‘slicing’ gesture of the hands, eyebrows, etcetera, an =tone by an upward ‘shrugging’ gesture, 
and a -tone by an open-handed ‘offering’ gesturing. 

It is interesting to reflect that most of the prosodic meanings so far are on the textual/deictic plane 
i.e. syntactically (!) linking with other parts of the text, and/or ‘pointing at’ alternative texts and 
elements in the situation. Only the three tone choices, expressing ‘involvement’, are on the 
(inter)personal plane. It is rather difficult to imagine what kind of neural processes they require. 

Another crucial nc point is that a hearer, but generally the speaker as well, does not know the end 
of a locution at the beginning. But at the last word, say after 4 seconds, all the preceding words/referents 
are still ‘present’, i.e. neurocognitively activated. This ‘span of consciousness’ seems to have a maximum 
duration of about 7 seconds, which is also that of a locution, and interestingly, of a single breath group 
or exhalation as well. The explanation must be, it seems, that the neural activation of the first word, 
after being pronounced, and every word after that, is cyclically kept going right up till the end. Although 
rather difficult to visualise one can only assume an extremely complex wheels-within-wheels-within- 
wheels neural activation process. Perhaps each word-cycle is ‘encompassed’ in the next one, each byte- 
cycle in the next one and each piece-cycle in the one after that. 

The final, even more crucial point is: what is rhythm? Everyone knows it, but nobody can explain 
it. To cut a long story short, let me give my definition of human rhythmic control or ‘rhythmicality’ 
and then comment on it. 


Rhythmicality is: the (human) ability to group bodily movements by RAX, within one’s span of 
consciousness, into (hierarchies of) yamBOS U —, TROchee — U, DACtylus — U U, (+)amPHIbrach(+) 
(U)U — U(U), anaPAEST U U — and/or MONE - , and sequences of these. 


This definition is meant to apply to dancing and singing, playing the piano, drumming, swimming the 
crawl, acrobatics..., but specifically to speaking. The mnemonic terms yamBOS (preferred to English 
iAMB), TROchee, etc., may be further memorised as BIDFPM or .bvustrodek~faipes.mo. For more 
‘embodiment’ the terms may be further ‘oralised’ as jambvs, trouki, dektilas, (anjamfarba"(an), 
gnapliist, mon. 

The whole definition hangs on RAX, a Rhythmic Alternation aXiom (or perhaps just maXim) 
saying: no more than two weaker beats between stronger ones and no more than one after/before 
pause. This is widely accepted in the literature as a principle rather than a maxim or axiom, to be 
subsequently just as widely ignored. Thus, all pronouncing dictionaries, to my knowledge, fail to mark the 
tertiary beats in words like .edjuxket/n.nll, .edjuxkerfnnl.li (not *.edjusker/n.nlli). Nor are they marked in 
sequences of up to eight socalled ‘unstressed’ 9 or shwa syllables as in: .hau! anxkamfa"'ta.bal dat 69+ .waz! 
snod.mints|.trerfnn| tfaaid3.#, again often wrongly stressed by foreign learners of English. 

It follows from RAX that the number of possible groupings with one ictus + remiss(es) is restricted 
to nine, as indeed (quite superfluously!) listed in the definition, and with it of course (primary) 
boundaries between them. It also follows that, apart from the MONE and the three (+) groupings, 
these are exactly the ones proposed by Greek scholars well over 2,000 years ago. The pluses, by the 
way, before and/or after amFlIba are to allow for patterns like wSww/S| wwswws|wwsw] 
1.e.amfaibo “an| mon| anamfaiba “an| mon| anamfatba “|, still in agreement with RAX. 

Most importantly, two more things follow from RAX, that are well-known to musicians (but not to 
linguists and/or phoneticians!). Firstly, rhythm groups (with two degrees of beat, ictus and remiss(es)) 
may contain subgroups with a secondary ictus and subsubgroups with a tertiary ictus, and so on, 
making up a hierarchy (i.e. tree structure) of ictus-remiss groups within groups. This is quite clear 


58 LUCAS VAN BUUREN 


when counting time to one’s piano or drum playing: | ONE TWO | THREEFOUR | ONE an|TWO an 
| THREE an | FOUR an|O NE yiany|TWOyiany| 7JHREFYy i any...... Secondly, if we find four 
degrees of (rhythmic) stress in English (as we did in educationally, the ‘uncomfortable’ example just 
given, and also in (1)), that logically entails a three-tier hierarchy of bytes, subbytes and subsubbytes. 
And vice versa of course. And primary |, secondary | and tertiary | byte-boundaries as well of course. 

This has quite far-reaching consequences. On the negative side, it forces us to introduce the rather 
terrible symbols B and D into transcriptions of English (at least when immediately relevant). On the 
positive side it throws up the (possibly vital) question where exactly to draw these boundaries and 
what are its rhythmic and semantic implications. Thus, for educational/y(writing , rather than syllabic n 
and | this time, and primary/secondary/tertiary BTDFPM for byte-structures), one may hesitate 
between: |.edjulskerfana\li| TDm, |.edjulskerfalna.li] TTb, |.edjulkerfona.li] TMp, |eldjuxkerfona'.lil 
MF+m,|.e\djuskerfaina.li] MFb and |.eldjuxkerfona.li| MTb. I would personally reject the last four, 
preferring TDm to TTb because, among other things, it seems to better explain 9 having become 
syllabic n, | in my speech. 

The four degrees of rhythmic stress may conveniently be called S(trong) (not to be identified with 
T(onic)!), M(edium), w(eak) and minimal or z(ero). It is sometimes convenient to subsume w and z as 
u(nstressed), which indeed conforms to traditional practice. Given the stresses within a word like 
educationally different ‘bytings’ would change its rhythm but cannot change its meaning. But the 
placement of byte boundaries (in the spaces) between words is very meaningful, as should be obvious 
when one moves some of them in the other examples given so far. 

If a speaker has to make a nc gesture for S/T bytes like Jan ad.minis-tration| .charge| or some .eggs'| 
and ~bacon|, it makes sense to assume he/she first has to activate the classifying M word charge or eggs 
before Specifying the whole compound. This looks like another case of wheels within wheels, so the 
meaning of a secondary or M word/byte (not syllables within a word!) may perhaps be said to be sub- 
specification within the larger one. 

The same argument applies to tertiary or w words/bytes, with some caveats. Their meanings can be 
said to be subsub-specification both in #an*kamfa"'ta.bal| dat 6a" waz! anad... andin #an*kamfa"'to.bal dat 
.09 1 wazi an ad...., but in #.dat! 69 1 .wazian ad... the won that is obligatory-not-optional given the 
one on was. Its meaning is therefore no different from that of z words (no-ictus z byte ruled out): 
subsubsub-specification or neurocognitively ‘given’ or ‘automatic reflex’. In such cases the w/z contrast 
could be said to be neutralised as u(nstressed). What precisely is the function or meaning of minimally 
stressed or z words is one crucial linguistic question still to be investigated. 

Before embarking on the analysis below, it helps to memorise the following symbols for the 
semantic status of each word written above the text, noting also that byte-status follows from these 
word-statuses: L, E, T/ +, =, - and S, M, w, z. 

The semantic status line also shows (sub(sub))- thought} | | divisions, idea |/ and sententia # divisionand 
whether each idea takes a blocking F or R#F tune 

Summary of meanings. L = (new) creation: word/byte/idea; E = (new) revelation, discovery: 
word/byte within idea;, T = new, contrastive word/byte. All three are also given +committed, 
=equivalent or -expective tones. S = (not-new, not-given) specified, pointed at word/byte, M = sub- 
specified word/ byte, w = subsub-specified, almost ‘given’ word/byte; z = subsubsub or minimally 
specified word, nc automatic reflex, ‘given’, ‘old’. 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 59 


2 ABE LEXY 


The following 21-line transcript is of the one minute answer (on accompanying mp3 audio) by 
President Obama to a question by interviewer Steve Kroft ending in: “... Did you over-promise? Did 
you underestimate how difficult this was gonna be?” After stating for each line the rhythmic- 
intonational and corresponding semantic structure, some (ad hoc) comparisons and/or comments will 


be added. 


z I+ | L= RK || semantic status line, directly derived from in-line notation 
l. 1 ~didnt|.over'promise”// TRanscript, with in-line notation directly derived from d&d analysis 
Ne le-) ~~ I  dots&dashes basic analysis (no word boundaries shown) 
am Yatba| .traki! "trouki~ || FitT// rhythmic structure, first “oralised’, then formulaic 
Al: i xdidnt| .over'»promise” |/ FItl/ WN. f. F777 ZI+| L= Ry 
Pl: 1.didnt|.over| -promise./ F| TT/ aa ore. ae | ZE-| MIS F || 


Comments. The best way, perhaps, to grasp the meaning and form ofalineistocompareits A(ctual) shape with 
P(otential) alternatives, as in Al and P1 above. P1 takes an F rather than an R tune, meaning alternative 
ideas blocked. Instead of an L(ate)= nuclear tonic on overpromise, meaning equivalent/uncommitted new 
idea/context creation, we now have an E(arly)- nucleus on / didnt, meaning expective discovery/revelation 
encased in the following not-new S-byte specifying referent/ context already activated in sucha speaker's brain. 
The high(er) pitch on I merely serves to allowa downward jump required for the expective -tone following. 

To facilitate grasping and ‘embodying’ rhythmic structures (not an easy matter!) a number of these 
have -as an experiment-— been written out as ‘oralisations’.. The F| TT// formula in Pl may thus be 
‘oralised’ as am .fatba | .trvki! -trouki.// Compare this with F|tT//in Al. As seen, capitals are used for 
primary, small-caps for secondary and lower-case letters for tertiary BIDFPM groupings; so all byte 
dividing lines except | can generally be dispensed with. The oralisations of these formulas are made to 
agree (as much as possible) with their status as bodily movements/gestures, hence trouki for T, trvki 
for T, traki for t, etc., while the original intonation is kept intact as well. 

The word-boundary in the first byte of Al affects timing but not rhythm: cf. the close vs two 
open(er) junctures in annoy, an aim, a name all three yamBOS. The two-word rhythms could be 
specified as +MON instead of yamBOS, thereby express-ing not just byte but also word rhythmicity. 
But that would complicate things at this stage. As a compromise, I leave spaces for word boundaries in 
the oralisations. 

As overdoes not sound M-stressed, it cannot, by RAX, be regarded as z-stressed, only as w-stressed. 
So overpromise cannot be analyzed as +F/anamrfatba, only as tT/.traki'trouki. One justification for such 
‘nitpicking’ is to achieve consistency. 


z S| Ts | T+ |S | z E+| wz wi S ®/ 
2. and .1 | xdidnt |, .undelrestimate] ~-how]|, how ~tough|.this! was.gonna! -be U// 





jom bostroukil, troki deektilas | -npn| , jem ‘bps |.maniam | fobs _mon | 


A2: how xtough| .this! was.gonna!-be_//._\ J/.'...1_ ] zZ E+t| wizwsS®*/ 
P2: ~how| "tough -this| was -gonna!.be. V[D7T-[ 7 1 _ T OTt+T=|S |zM! L-‘/ 


60 LUCAS VAN BUUREN 


Comments. The rhythm M= M= M= FMz¢ of P2 may be pronounced 7m[n= 6m[n= 5m[n= ,m 4f.b,B 
8m[n!4> Compare this with B= mfM# for A2. Also compare the semantic status of each word in A2 
and P2. Their values are derived directly from the in-line notation. 

Distinguishing between secondary M and tertiary w stress sometimes seems questionable or 
superfluous. This said, distinguishing M-stressed waz :gvna vs w-stressed waz .gana would seem to 
make sense. The other alternatives to A2 are fairly straight-forward. P2 is again F-blocked rather than R- 
unblocked. The referent of Aow (i.e. kind and degree of toughness) is selected with commitment and 
therefore pronounced as a T+ byte/thought rather than as a z-stressed ‘given’ referent. Similarly, the 
referent of the S word/byte ¢his is now vocally pointed at/specified rather than subsub-specified asa 
near-given w-word. The L- introduction of be(relating thisto tough) creates an evident/expective new 
idea/context, whereas S in A2 specifies beas a kind of relationship, i.e. context, already activated in the 
speaker’s brain. And so on. 

The symbol ... is used to indicate ‘thinking pauses’, a term preferable, I feel, to ‘hesitations’. These, 
like ,,m’s, are outside the rhythmic patterning. It is revealing that even a famously eloquent speaker like 
president Obama does not normally construct syntactic products before pronouncing them, but builds 
them up as he goes along. Perhaps syntax is more ofa controlling than a generating mechanism. 


S| T= |L-  “y S| T+ |wi S| T=) wi le FY 
3. , gam|,  -1| "always| be.lieved_/ -that| , *this| .was! -a] , "long! term] .project.# 


ee | es ee NY ee et 


reMvn|,-mon| "trouvki | ambosv |/ -mon| .~mpn| .mani.mvn| ,”mvn! man | .trourki., # 


A3: .that| *this|.was'a| long! :term|.project.# —[\].{_]~ J_. § S/T +|wi S|T=mwj L— F Ht 
P3: .that! this ~was| a -long | .term|-project./ T.v7T._i[_— oJ wizE+|zS!iM|S Ri 


Comments. A3 is rhythmically: M| M| mM| Mm| T#, P3 is: mB] B] MT// .man! jam ~bvs| jam -bvs! .mtn| 
-trouki// .Consider again the difference between L and E nuclear tonics, crucial to communication. In 
A3 the late nucleus introduces (was a long-term) project as a new thought complex, thereby creating 
a new situational context. In P3 ‘tonic-less’ a /ong term project is pointed at, specified, as a non-issue 
already activated in the speaker’s (not necessarily the hearer’s!) brain. Cf. STEP 4in App.1. 

The E+ on wasin P3 makes its referent (identifying given this with the extant thought complex to 
follow) a committed (new) revelation or discovery within that context. As a T word/syllable it is S 
stressed. The same word in A3, pronounced waz, can be neither S or M. Being less than minimally 
short it cannot be quarternary z either, only tertiary w, meaning neurocognitively subsub-specified or 
‘almost but not quite given’ referent identifying newly selected this with the new thought complex to 
come. Clearly the nc processes generating A3 and P3 are very very different. 

But the most intriguing are the speaker’s non-tonic, but S-stressed 9\2t and 9e., both followed by 
thinking pauses, where A3 omits these pauses and P3 has w/z stressed *\,t and ,> The best way perhaps to 
grasp the meanings of these forms is to consider their T forms first. Thus, the conjunction 7\2t in /know 
that (vou love me) seems close to a demonstrative pronoun contrasting the factuality of what follows 
rather than its reason, time, manner, agent..., which would require why, when, how... When followed by 
a thinking pause indeed it starts off as a demonstrative pronoun, with the following ‘fact’ then coming 
as a kind of apposition to it. (Perhaps thinking pauses can be intentional after all.) When ‘thatis S- 
stressed, as in A3, it merely points at the not-an-issue factuality of something; with z-stressed dat as in P3 
its factuality is already established or given in the speaker’s neurocognitive process. The meaning of the 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 61 


indefinite article a, roughly ‘one option not yet considered’ can similarly be regarded as new, specified 
or given, depending on its T, S or z status. 


4. it.wasnt| a short term|-project .# Wan xforut'3am pra'astk. 
am .fatbo4| am ~far ba “| -trovuki.# F’?| Fl TH 





A4: it .wasnt|a ‘short term |-project.4 “_....\-.[—.f# ZI-|zZE+z| S$ F Ht 
P4: it -wasnt|a short! .term|-project.# zT=|zE-! M|S Fy 


Comments. Even very clear and eloquent speakers ‘swallow words in spontaneous, informal speech 
like this. As indicated by my impressionistic phonetic transcription, I cannot (as a non-American) 
quite make out if the speaker says it wasn‘, nor whether it is tonic. The sequence short term sounds 
(again to the foreign ear?) much like a single word, having even weaker stress on its second syllable 
than in the previous line. But it must be two words considering that the t is longer (nor American- 
flapped) than in auto, short earned, sure term. Any listener’s perceptions, it seems, are no more than 
approximations, best guesses based on his own linguistic history. Compared to traffic-lights or a 
computer language, speech is a rather imprecise means of communication. (Which may be just as 


well.) 


R R 
wi S|} S |w T+ | L= yz TT] EF jz SJ 
5. ho! and|e you-know| ,for| , “indi! .vidual| aymeri*cans~/who .are| ,*struggling| right-now _|/ 


Site eee Ne ee ee Ne 
(m)M|IB |,m,TT|Fm / B,TB/ mani ~.mpn |jam -bvs| ,...// jom.bvs| , ~trouki | jam -bvs_/| 


AS: .for! indil.vidual| aymeri*cans~/ w'! T+ L=Ry an Now 
PS: for_indi'*vidu'all a7me'ricans” + Ts 


A5: mani strouki.trvki| amrfaiba'man”| P5: am .f1ba! troukilman| jam”bvs'ijom:bas~|/ 









Comments. Corrections invited for you know if misunderstood. 

According to RAX, the word Americans must have a w beat on its last syllable. The (pseudo?) 
question then arises if the whole word should be analysed as Fm or Bb, as either or as neither. In 
teaching (British) English rhythm to Italian or Spanish learners one may prefer the Fm pattern. Cf. 
‘bytings’ of educationally on p. 5. 

P5 requires. by RAX, a w stress on the final syllable of individua/, as it would if the word were 
technological But now is perceptible in the recording. This suggests not three but only two neural- 
motoric syllable movements between the stronger ones, with concomitant implications for the 
concepts of syllable and syllable-elision. 

As for technological Americans, I feel that with early S, as in A5, this means ‘a particular type of 
American’, with later S as in P5 “Americans with a certain quality’. This may be clearer in the unknown 
soldier, an ideal husbandor a Chinese restaurantas opposed to later S in my Chinese wife. Such early stressing 
is widely known as ‘the’ rhythm rule and regarded as an automatism rather than a semantic option. 
Form<>meaning linguists would assume a meaning difference. To keep our semantic formulas 
reasonably legible, however, such word features are not inscribed. 


62 LUCAS VAN BUUREN 


T+| S| T= |S |Z WwW! L—- Fy 
6.  xthey|-have] ”every|-reason| to ‘be! im.patient.# 


xmpn| .mvn "dektilas| ~trouki| jam ‘bas! am.faibo.. # M| M| D| T| bF# 


A6:*they|-have] "everyfreason|to ‘be!im.patient. 4 \VL/—77- 777. #It|S|T=|Sizw' LF Ht 
P6:they .have|.every|‘reason|to .be'im-_patient .# “7 /\./..'.— ~ #Z!I—|I-{I+ zw'sky 


Comments. The upturn on Aave in P6 is merely to allow the following downjump. 

Again consider the differences, rhythmic, intonational and semantic, between what the speaker did 
and didn’t say. A6 might be paraphrased as ‘there are lots of reasons why they should get impatient’, P6 
as ‘they are impatient and fully justified in feel-ing that way’. A6 is about people’s feelings, P6 about their 
rights. The two ideas/ sententias differ not only on the deictic and (inter)personal planes, but also on 
the ideational one. Again, they seem hardly derivable from a single syntactic construct. 


TH} z= |] Siz S$ |S]b- 
a *they| should -want|,  ~all| these -things| -solved| to.morrow.# 


A] ee lee ee 
xmpn| jam rbps | “mpn| jam “bvs| -mpn | om.faibo. # M| By, Mj By M] F¥ 


A7: xthey|should "want|-all| these -things|-solved|to.morrow.# T+|ZT=| S|] ZS] S| "4 
P7: theyxshould| *want_¥// xall| these -things|-solved| to-morrow. #ZT+|L+ Ry E+|zS| S|S "4 


Comments. Here the ideational effects of rhythmic and intonational choices may be even clearer. A7 is 
a simple observation implying ‘getting all these things solved by tomorrow is what these folks would 
normally aim for’. P7 paraphrases as or at least implies ‘what they really ought to demand, is that, all of 
it, not just some of it gets solved tomorrow’. Sounding almost like a reproach this would clearly be the 
last thing the speaker wanted to say. It shows up the crucial role of rhythmic and intonational control 
if only to avoid saying the wrong things. 

Much of the neutral/observational effect of A7 is achieved by putting it in the new but expective 
context of tomorrow. By making theyshou/d wantin P7 asingle non-final idea, with both shou/dand want 
committed choices excluding alternatives, the tone is set for the strong opinion to follow. This is further 
strengthened by its committment to only a single new E+ revelation in an accepted context. 


F 
Z + | S | T=| tT | T+ | TF | Ss | L- | 
8. a°r , |1t xdoesnt| -matter| "how|.good| smy| eco’nomicftheories| .are.,// 


ao, - N -] = -| 7 | = | NI... 7 "fT 7 * | 
jomrbas,|am ~faiba| -trouki | "mon| «myn | xmpn|onamrfarba| ~trouki |Ampn..// 


A8: rhow|.good| smy|eco”nomic| theories] .are./ SLIN[7. 7 TL I= [T-|T+)T= |S| L*/ 
P8: show| -good| my ”eco} nomic| theories] -are./ Y[of. 17 i. - 1. || T+|S zT=|E-| S'/ 


Comments. It is difficult to distinguish by ear, and therefore to analyse, the two pitch patterns shown for 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 63 


how good in A8 and P8. The trick is to say both and then to replace howby a word with a tail after its 
stressed syllable, e.g. incredibly. The meaning difference between T=| T—| and T+] S| should then also 
be obvious. 

Whether to ‘hear’ theories as a trochee or dactylus depends on quite abstract considerations like 
one’s phonological views and linguistic background. Cf. breweries. 

In a formttmeaning approach like the present one it is always assumed that different options have 
different motivations or meanings, however subtle. As already suggested under line 5, economicin A8 
implies ‘theories about economy, that in P8 ‘theories of the economic type’. Falsifications/verifications 
welcomed, as always. 


F 
wiz T+| zzeE=| M!S | 
9. Af? you ~dont| have a "job| ‘right! snow ‘y/ 
_ oN 
monijam ~bys| ana plist] min} ~mpn>// mB] P| MM/ 


Comments. If the E-word job were chosen with (personal) commitment, it would take a +tone, if 
chosen as the expective option, a -tone. As it is, the pitch does not, after the upward jump, come down, 
therefore job has an =tone, meaning ‘just to mention something, i.e an equivalent, non- committal 
choice. F tunes with an E(arly) or L(ate) =tone nucleus may convenientlty be referred to as=F, the rising 
counter-part as =R. The latter are extremely common in non-final clauses, lists and surprised 
questions. But =F is common in some languages and dialects, while almost absent in others. 

My LACUS 35 article shows that the ‘shoulder-shrugging’ =F pattern is common in the French 
text studied but absent in the same story read by an American actress. vBuuren (2012) contains more 
discussion of the intriguing =F tune. 


F 


Z le | 2 ? Miz S |zLliS | z L+ # 
10. the xonly| , .eco|-nomic} .polil.cy | you "want|to .hear | -is..//,1 m~hired.# 


po Ne ee et ee fe ee Ne 
am ~farbo| ,.trvkil-troukil.trokilman| jam "bps | jam .bvs| -mpn.// , am ~faibo.# 


Comments. The rhythmic formula is F] TTTm| B] B| M/ , F#. The first , is omitted as it seems an 
unintended thinking pause, but the later one appears functional/controlled. 

To this non-American listener (economic) palsi was unintelligible until it eventually dawned that it 
must be what I know as pvlisi. On further consideration, it seemed that the speaker does not say palsi at 
all, but pallsi, i.e. a dactylus with a syllabic | in the middle. It again goes to show that, in the absence of 
thorough familiarity with a language, linguists speculate a lot. This may also apply to my analysis of 
economic policy -in view of what seems a reduced time delay after the first word— as SM rather than 
SS. Rightly or wrongly then, I perceived this as an XY compound on a par with big band, fast Jane, 
White House, social worker. 

As suggested in App. 1, Step 6, it is quite difficult in the absence of a tonic to betoken and/or hear 
early vs late stress in economic. Perhaps the phonological contrast is then neutralised, and thereby the 
semantic one (if any) between qualifying and classifying. 


64 LUCAS VAN BUUREN 


zT=|z L- gM! z T=) zl #wiz T+ T+]z Lt # 
11.1 "got|a job. # .1| can "pay| my .bills.# .1! can “look| <after| my ~family.# 


| = # -) - | CO ey COUN NN! UN 


jom bosom .bos.# min| jam "bvs|jom .bvs.#.man! jam ~bvs|~troukilam ~fabo. ff 


All: 17got|a job. # .1| can "pay| my .bills.# .1| can “look| after] my ~family.# 
P11: 1.got!a "job .11 can .pay! my "bills 1 ”can| look -after| my ~fami'ly-¥ 


Comments. Tee-tumming A11 yields something like #ftee tum tee .tum# tee "tum tee .tum# tee *tum 
*tum *tum# . The speech prosody seems not far removed from a com-plete musical phrase with the 
‘meaning’ premiss-premiss-conclusion. That of P11 #tee ee rtum// tee ee "tum// tee ce "tum tee ee ee# 
strikes one as more of a single musical phrase remaining unfinished. It may be relevant to note that 
Dunbar (2004) and others hold that human speaking developed out of dancing and singing. 

A11 consists of three separate locutions/sententias. The very common T=...L- combinations in the 
first two locutions have the effect of a ‘reasonable’ (-expective) answer to one possible (=equivalent) 
question: got...’ pay...? The third prosodic pattern, introducing a new situation after 
three+committed choices suggests a logical conclusion to the two preceding premisses. 

The pattern of P11, a single locution consisting of three uncommitted ‘shrugging’ =F ideas, would 
reflect a completely different neurocognitive process. Randomly selecting two examples and one 
possible speaker-involvement in an extant context may come across as a rather disdainful, even 
indignant rejection of alternatives. 

But the nc processes concerned are in fact far more intricate than this. Line 11 is directly reported 
speech from an imaginary member X of the public (in line 10), himself elaborating on an indirectly 
reported ‘you rehired fromasecond imaginary speaker Y. So the question arises whether Al 1 and/or B11 
are reasonable ‘identifica-tions’ with X’s thinking and thereby portrayals of him. Any politician saying 
P11 would clearly show disdain for the public and harm his own reputation. President Obama’s version, 
on the other hand, shows delicate powers of identifying with his audience which may go towards 
explaining his appeal as a public speaker. It indicates again that rhythm and intonation (better known 
perhaps as ‘tone of voice’) have not just deictic-textual, but also important interpersonal and indeed 
ideational functions. 


12. .29m|, "but| , -what|,, what 1 | under-stood| *coming| “in_// ~was| -that| , 

13 .re-ver'.sing_//,a ~culture|-here|in*wa'shing.ton./<do!mi.nated| by<special| «interests .} 

14. 39], re-versing.// .ai poclitical| culture // that! was sdolmi.nated| -by| , ~polls.# 

15. .and! <sound| .bites.//-and| , a *twentyfour| .hour| news} .cycle.# 

16. 93a] , re-ver'.sing_// <structural| ~-problems] in .our! exconomy| .that! had been ~building up| .for' , 
twordecades.# 

17. thats gonna -take| "time*¥ thats .gonna take| -more| than a-year.// its .gonna! -take| 

,-more| than *two| xyears.// .1ts |, "gonna!' ‘take| , ~«more] than ~one| "term ‘yf 


~probably| -takes| -more| than ~one]| ”pre'si.dent*# 


Comments. First some points of detail. It is hoped the reader will by now be able to convert 
the in- line notations into the corresponding d&d, rhythmic and semantic ones 

My original reason for selecting the present one-minute passage from the 60-minute tv interview 
was —with apologies to President Obama- some illustrative rhythmic and, especially, =F intonational 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 65 


patterns. The =F pattern occurs three times in line 17, making the pieces ending in time, term and 
president equivalent ‘just to mention something’ ideas rather than committed ones. It also occurs in 
lines 9, 20 and in P11. Unforeseen rhythmic phenomena include strong forms that are generally taken to 
be weak as in lines 3, 5, 12, 15 , ‘swallowing’ syllables or whole words (printed small) in lines 17, 22, and 
thirteen thinking pauses in lines 12-17 alone. The words Washingtonand presidentare again perceived 
by this listener as Mb rather than Tm, dominated(2x) as MF rather than TT. Note also that in line 16 the 
numeral two has ‘given’ z-stress, and had been building up +F + an am faba" an nicely illustrates one 
rhythmic grouping predicted by RAX. 

However, the most intriguing feature of lines 12-17 is something that takes us well beyond the 
original plan, raising fundamental questions about the relationship between non-verbal, pre-linguistic 
thinking and its subsequent expression in the language one learnt as a child. Hoping to come back to 
this in a future LACUS paper I can only offer some speculative remarks at this point. [still to be 


rewritten | 
Lines 12-17, taking 37 seconds in all, consist of 7 locs/sententias or complete(d) ideas, marked off 


by #. The fifth and the last take less than two seconds each, the others, except the first, just under 5 
seconds each, all well within the span of consciousness. Only the first loc exceeds short term memory, 
taking 9.4 sec after the initial “BUT". If only for that reason, it is probably better analysed as two or 
more locs. 

Only after repeated listening, not immediately, did this listener realise that the 37sec passage is not 
just a sequence of statements elaborating, extending or enhancing (Halliday 1994:220) each other, 
butmake up a single identifying (ibid:122ff) sentence, thus: What J understood (identified) was ‘that... is 
going to take time’ (identifier, line 17). The 1.1 sec identifier is then ‘elaborated’ (= i.e., ‘in other words’) 
with three more clauses in one 2.8sec locution and finally embellished or ‘enhanced’ by the 1.6 sec final 
locution/clause. The subject ‘that’summarises ‘reversing’ subject clause 1 elaborated by ditto 2 and 3, 
each roughly but not quite, coinciding with less than 5 seconds locutions. 

The obvious question to ask is how a speaker's 37 or more secs syntactic structures, if indeed not 
ready-made beforehand, relate to a string of immediate, short term neurocognitive activations. One 
may first consider what went before. Indeed, on closer inspection we see that the whole one-minute 
text consists of three parts admirably, and logically, connected to each other. In part A, up to line 12, 
replying to the interviewer's suggestion that he overpromised or has underestimated his project the 
president counters that it takes time, although people understandably want to see immediate benefits. 
Part B, lines 12-17, starting with a free-standing ‘BUT’, lists some external obstacles explaining why it 
takes time. In part C, starting with free-standing ‘SO; the speaker concludes that people’s immediate 
concerns remain his priority. 

It seems reasonable to assume that by the beginning of line 12, the speaker had a fairly general, 
perhaps pre-verbal perception that ‘identification of obstacles encountered was called for’. Just as on 
perceiving a certain sky a person may (or may not) project the lexical symbol ‘cloudy’ on it, perceiving a 
need for identification may call forth a pseudo-cleft syntactic symbol like “what we want to know is...’ 
This could be all that is needed to start building up a long and complex syntactic construction like the 
present one. Again, syntax appears more generated than generating. 

The solution to this conundrum may partly lie in the nature and status of non-verbal thinking (+ 
pointing and gesturing) before language follows, ontogenetically in the development of babies, 
phylogenetically in human evolution, but also in our everyday activities. When driving a bus, playing the 
piano or cooking a meal much, if not all one’s thinking is non-verbal. Judging by Frans de Waal’s (1982) 
Chimpanzee Politics many (all?) animals have concepts like good, bad, this, that, taste, touch, hungry, 
me, him, her, danger... which must be related to each other to survive. It is what brains are for. Since 


6G LUCAS VAN BUUREN 


language is transmitted through babies, perhaps it was invented by them. At the same time, language, as 
suggested by the Sapir-Whorf hypothesis, is to a considerable extent the tail wagging the dog: the 
neurocognitive processes of Menomini speakers are quite different from those of speakers of English. 
One would like to know how different. And, especially, how similar. 


R 
S| z+] Siw] T+!zM{z T+ | T= iz L-'zw | 
18. -so], .a| SO “ij, -try!.to|] ,*keep! in .mind|theim*meidi.ate|,”chal len: eo iof .me_// 





A18: the imsme'di.ate| , "chal'len-ges| in .front! of .me_// Pb| , Mb! Bb// 
P18: tthe! im-mediate] ”challen' ges] in .front| of -me_// mF+| Tm| B| B// 


Comments. Consider (i) jn ~<frant]| av .h33"/htm/dem/as/juu/mii.// vs (i) jin ~frant! av 

.a"/1m/dam/as/ju/tm.//. Compare also (iii) jar ‘lark| -h33"U// with (iv) lat lark a4 U//, etc. A speaker 
would say (iii) with Aer S-stressed, i.e, specifying/pointing at the referent (also with eyes, head, index- 
finger...), if she, whoever she is, had just entered the scene and the speaker’s awareness. But ifhe had er 
already in mind, it would be given minimal (z or w) stress, as in (iv), and the referent, being ‘given’ would 
not be pointed at. Ifthe speaker made AerTonic, he would of course be contrasting the referent with other 
candidates in mind. The same arguments apply, mutatis mutandis, to (i) and (ii). In most English 
pronouns (except J, they?), auxiliaries and prepositions there is h-dropping and/or vowel reduction in 
w/z forms. Listening to w-stressed m1 in line 18 and S-stressed bii in line 2 may help to clarify these 
points. 

In P18 there is no reason for a w-stress (and therefore a subsub-byte) in F+ amfaibaran immediate, as 
there are only two z-stresses in a row. But in A18, the ‘no more than one z before silence’ axiom entails 
a w, hence Bb. An important task still ahead is to state word structure and then how these forms may 
be ‘adapted’ in longer sequences, mostly by losing w/M stresses. Cf. a first attempt in vBuuren 
2005:116. 


wi Ts]wijl-yTtiw T+ zi Te iz Ley 
19. day; “in| -day; .out.// show]. | xput people] ~back| to »work~// 
ep ee NN ee 


Comments. The word peop/e sounds z-unstressed, its on-screen duration being about 245 
milliseconds. The same word in line 20 sounds S-stressed at a (still shortish) duration of about 387 ms. 
The reader may also wish to compare the durations of the word preceding peop/e. Further to the 
comments under line 18, the question arises here why peop/eshould ‘lose’ its word stress, thereby its full 
or literal meaning to function almost like a pronoun. Perhaps the crucial question to ask in 
‘accentuation’ studies is not where the accents go, but where they are omitted. Cf. //,have you! rread| 
one flew ‘over! the -cuckoo’s! .nest./) wZT=|zzwizw'L— "/, with a complete sentence as a single 
(primary) byte. More examples in vBuuren 1990:98ff. 

Both instances of day and the word J also sound unstressed/suppressed/‘given’, though not 
maximally so. Therefore reluctantly w rather than z, arbitrary decision. 


ON RHYTHM ANDINTONATIONIN THESPEECH OF BARACK OBAMA 67 
F 
witTt+| T+|]z S | z z E| S I| 
20  .and! <put|, ssteps| in -place| that , can "help|~people’y/ 
a A A a 


man! <mvn|, smvn| jam ~bvs| ma? , jam "bys | ~trouki* |/ mM|,M|B| m?,BiT/ 


Z E=|Z Z Mi Sj) S| Etlz zM SS ‘4 
21  , to "get|in the * middle} ~class*V/ , -and| , <stay|in the .middle!_class..# 


ge Nas te 


Jem”bps|an a °fiba |"mpns|/,-mon| ,mpon| ana .fibo \mon. # BHIM/MMyMy 


Comments. A series of =R pieces as in #-all| you -need| is yapples~// "pears~/ »plums~// and ~bananas.# 
makes up the well known ‘listing intonation’. As said, =R piece means ‘equivalent, uncommitted 
concept/thought/idea, alternatives unblocked/open. Compare #rall| you -need| is rapples*/ "pears‘y/ 
rplums*/ and ~bananas.#. Here the =F tunes have exactly the same meaning except that alternative 
thoughts/ideas are (at that moment!) neurocognitively blocked. It therefore does not feel like a list of 
related items, but more like a piling up of one idea on top of another as it happens to “come to mind’. Cf. 
also the comments on lines 11 and 17. Oddly enough, =F patterns never seem noted in the literature. 
These ‘fruity examples may help towards the semantic analysis of lines 20-21. The reader may also wish 
to experiment with saying numbers, friends’ names or things to do today. As always, the meaning 
differences concerned may be a little subtle, but should not be beyond the reach of any self-respecting 
linguist. 


68 LUCAS VAN BUUREN 


REFERENCES 


ABERCROMBIE, DAVID. 1964a. “Syllable quantity and enclitics in English”. Jn Honour of Daniel Jones ed 
by D. Abercrombie et al, 216-222, London. Longman. 

—---~- 1964b. “A phonetician’s view of verse structure”. Linguistics 6. 5-13. 

BOLINGER, DWIGHT. 1972. “Accent is predictable (if you're a mind reader)”. Language 48, 633-644. 

COOPER, GROSVENOR & MEYER, LEONARD B. 1960. The Rhythmic Structure of Music. University of 
Chicago Press. 

DE WAAL, FRANS. 1982. Chimpanzee Politics Lon. 

DUNBAR, ROBIN. 2004. Zhe Human Story. London. Faber and Faber. 

FRAISSE, PAUL.1974. Psychologie du rythme. Paris. Presses Universitaires de France. 

HALLIDAY, MICHAEL A.K. 1961. Categories of the theory of grammar. Word 17. 

———--- 1963. ‘Intonation in English Grammar’. Transactions of the Philological Society. pp.143-169 

—----- 1975. Learning How to Mean. London. Edward Arnold. 

------ 1985. Introduction to Functional Grammar (2nd ed. 1994, 3rd. ed. 2004). London. Arnold. 

KEIJSPER, CORNELIA E. 1985. Jnformation Structure. Amsterdam: Rodopi. 

LAMB, SYDNEY M. 1999. Pathways of the Brain. Amsterdam. Benjamins. 

NESPOR, MARINA & VOGEL, IRENE . 1986. Prosodic Phonology DeGruyter. 

VAN BUUREN, LUCAS. 1975. “Phonological hierarchy in English”. Linguistics in the Netherlands 
1974-1975, ed. by W. Zonneveld,70-80. Lisse. De Ridder. 

—----- .2004. Rhythm and intonation considered neurocognitively. LACUS forum 30:137-146. 
lacus.org/volumes/30/301_vanBuurenLucas.pdf 

—----~ .2005. On timing and rhythm - in British English. LACUS forum 31:113-123. 
http://www.lacus.org/volumes/31/vanBuuren_l.pdf 

——---- .2012. Rhythmicality - precondition to language (to appear in LACUS forum 39). 
Prepublication on http://www.linguavox.nl. 

—----- .2013. Rhythm, rhythmicity, rhythmicality. Paper presented to Nederlandse Vereniging 
Taalwetenschap. Published on http://www.linguavox.nl. 

YESTON, MAURY. 1976. The Stratification of Musical Rhythm. New Haven. 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 69 


APPENDIX I. PROCEDURE 


The procedure followed in analysing the text seems best presented as a concise practical guideline 
both as explanation and as an example to follow. 

Having obtained a recording, the first thing to do is to write a working transcript of it. Ours is in 
ordinary spelling rather than phonetic transcription, but without any interpunction or capitals (hence i 
for I), with some ad hoc symbols like , , 9, 3am, and with reduced-size letters for ‘missing’ sounds. 

Next, one wants to divide the text into 2-4 second sections or ‘lines’, each corresponding, within 
reason, to a sentence or sentence part. Nowadays we have the benefit of cheap computer programs like 
Sound Studio® allowing on-screen text selection, repetition, comparison, etc. Being unimaginable only a 
few decades ago, this offers wonderful opportunities for young linguists. * 

Taking one line at a time, such as line 2 of the recording 


and 1 didnt underestimate how , how tough this was gonna be 


one first draws two parallel lines under it, as shown, to ‘plot’ a (stylised) Armstrong and Ward (1926) 
pitch notation with one dot or dash under each syllable. Only two levels need be recognised: high(est) 
and low(est) between the lines. 


Quite often the interval between the two lines is roughly about an octave, i.e. that between do and 


do! in do-re-mi-fa-so-la-ti-do!, but it can be less, or more. Some familiarity with this sol-fa scale 
should be sufficient to work on pitch in speech. Anyone new to it may be advised to watch and listen to 
the Doe-ray-me-far-sew-lah-tea-dough episode in The Sound of Music. By humming very rapidly through 
this scale itis easy to produce and learn to distinguish a variety of rising, fallling, rising-falling and falling- 
rising mmmm ‘comments’. And level ones as well, of course. The procedure to follow is in nine steps. 


STEP 1. Plot +Tonic falls. Which words sound the most important? The falls from high to lowafter an 

upward jump, generally no more than two or three in a line, are the easiest to pick out by ear. Even 

without any sol-fa knowledge one will hear them as (or make them on!) the most prominent syllables of 
the most prominent words. Draw them as dashes (straight lines) from left-high to right-low, thus: V 
under the syllable concerned. These are the pitch-accented or T(onic) syllables and words. The 

function or meaning of each T-word, (not its T-syllable!) is contrast, not ‘that’ but ‘this’i.e. excluding 
alternatives, i.e. new. This is a speaker-choice on the deictic/pointing level. Final or ‘nuclear’ T’s may be 

E(arly) as in JOHN loves mary to reveal a new thought in an existing, not-new idea/context, or L(ate) 

as in john loves MARY, to create a new situational context. 


STEP 2. Plot = and - Tonics. But there are (at least) two more, slightly less prominent and less 
common, T’s in English. The falling Tonic in Step 1 is always preceded by an upward jump (even 
utterance-initially, from a position of vocal rest). All three T’s mean contrast, new or, vice versa, all 
contrastive words take a T. The difference is in their tones: +tone, upward jump then fall, means 
committed choice of referent/ word; =tone, upward jump only (no fall), means equivalent, 


' Lacus members are welcome to free copies from the author of the Linguavox phonetic and dots&dashes fonts used in the 
article and of the Sound Studio file. 


70 LUCAS VAN BUUREN 


uncommitted choice; -tone, downward jump only, means evident, expective choice. Unlike most other 
prosodic choices, these are clearly not on the deictic but on the (inter)personal plane. Obviously, an equal 
tone must be plotted in the d&d graph as a high level dash ~(after a jump from low), a minus tone by 
a low level dash = (after a higher syllable). 

So one can hardly miss the +, = and - tonic syllables. What makes it even easier is that (as helpfully 
demonstrated by people talking on their phones) +syllables/words are naturally accompanied by 
‘slicing’ gestures of hands, head, eyebrows..., =syllables by ‘shrugging’, and —syllables by ‘offering’ 
gestures. These are of a piece with the speaker’s neurocognitive (nc) gestures. Observe your own facial 
and other gestures when saying 7 LOVE you, darlingwith +, - or = T on LOVE. 


STEP 3. Plot S-beats. Having plotted all the T-syllables in the dots&dashes (d&d) graph, the next step is to 
locate all non-T syllables having an S(trong) beat or ‘stress’. S-syllables are potential T’s but lack the 
preceding pitch jump. So apart from this (crucial) detail they sound exactly like -tone and =tone syllables 
and must be similarly plotted as low = or high “level dashes, only this time of course on the same level as 
(the end of) the preceding syllable. The naturally accompanying gesture is often ‘finger pointing’ + “eyegaze’ 
at referents such as ‘television’, ‘phone’ or ‘here’ in the QUEEN is on television/the phone/here. Observe 
that as in JOHN lJoves mary mentioned above, the T is here an E(arly) revelation in an existing idea/context. 

The meaning of a (non-tonic) S-word/concept like te/evision, and thereby of an S-byte/thought 
like is on the phone, is neither ‘new like QUEEN (or PRESident) nor ‘giver like the, is, on. It is merely 
Specified (‘pointed at’) as a perception activated in the speaker’s brain. These are perceptions in the 
speaker's ‘here and now, but not necessarily in the hearer’s, to whom they may come as a surprise. Cf. 
the possible surprise value of the HOUSE is on fire, your HUSBAND is in paris (with me). 


STEP 4. Plot primary byte boundaries. A rhythmic grouping, as we see in counting time, cannot have 


more than one ictus in it. So since S and T syllables are ictuses, a boundary = has to be drawn between 
them for a start, thereby dividing the line into as many (primary) bytes as there are S/T words. The 
question is where. The answer is: at the openest juncture (ritardando-accelerando in musical terms) at 
the highest grammatical-semantic boundary. Some of the bytes thus created will be BTDFPM 
groupings, any longer ones, RAX reminds us, will need sub-division later. 

If each word corresponds to an nc concept, the meaning of a byte is conceptual constellation or 
thought. As always in our two-pronged formttmeaning approach, forms are not simply identified by 
their physical shape, but also by their meanings. 


STEP 5. Plot u syllables. All u(nstressed) syllables in English are (relatively) ‘suppressed’, most clearly 
timing-wise and in their vowel, which is frequently reduced to 9 (shwa) or even absorbed by 
surrounding consonants. In d&d notation u-syllables are represented by dots: ~*~ or +. Naturally, those 
immediately preceding a -tone should be plotted high and so should all those following an =tone, up till 
the next -tone, if any. All other u-syllables may be plotted low. As said, such a notation is ‘stylised’. One 
will hear pitches other than highest and lowest and also differences in the range of pitch-jumps, but such 
detail is better set aside as showing only degreeof speaker commitment to the choices made. 


STEP 6. Plot M syllables. [still needs rewriting] This is the most challenging task so far. Having plotted 
most syllables there may still be one or two left over with the generally recognised secondary or 
M(edium) stress, to be marked with a half-sized dash, ~ or ~. They are clearly different from T 
syllables. The problem is that they sound virtually identical to (pitch jump lacking) S-syllables in 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 71 


loudness, duration and vowel quality. An M is an S, so to speak, that is not a potential T. 

Thus in a ‘VEry educating(SuMu) education(MuSu), or better for our purposes, a ‘Very 
busticating(SuMu) bustication(MuSu) (Northern U.S word meaning ‘break into pieces’) —all said on alow 
monotone after ~VEry— the difference between S and M is well-nigh impossible to hear for non- English 
speakers. This applies even to the compounds (when said on a monotone) in bus conductor (S uMu) Gus 
McKenzie’s (M uSu) foster mother’s (Su Mu) custard pudding (Mu Su), although the open(er) juncture 
between words helps to hear the rhythmic differences. So clearly, one’s whole knowledge of the language 
including its rhythmic/tonic patternings must (as always) be mobilised to ‘hear’ these differences. Taking 
some of these examples an English speaker knows it would be rhythmically bizarre to write and say: 


What a *TERrible bus-tication, that the buscon ductor’s MF+|MF/.man' 
am ~faibo-an|mvn| amfi1ba]/ MFT] .man! am farbo*| trouki| rather than mM 
F+|TT// .man! am ~fatboan | trokiltrouki// m B F|.man! jam bos! amf1bo| and 


fos termother bus. Ticated gusmac kenzie’ cus tardpudding. 
MFM# mon| amfaiba| mon# FBTMF#amfatbo| jambvs| trouki|mpn| amfaiba# ratherthan 
TT /troukiltroki/ = TTIM FIT T# troukiltroki| mtn! amfarbo| troki| trouki# 


This indicates that some perceptions, such as M vs S, are largely achieved by elimination or inverse 
evidence indeed requiring thorough familiarity with the language. 

Note that, apart from M vs S, our ‘busticated’ example also yields some secondary and tertiary byte 
dividers in the right places. If the meaning of an S word/byte is Specification, that of an M word (as in 
bus| conductor) is sub-specification (within the larger byte). M stress within an S/T word like 
busticating has no semantic function of its own. 


STEP 7. Plot w syllables. Writing the symbol * for +Tonic, - or - for S, and ° or . for M ina text like hau 
Anskamfata.bal dat da4 .waz anod.mints.trerfnn .tfaa“d3.#, that would still leave eight u(nstressed) shwa 
syllables between stronger syllables. But RAX tells us this is impossible: some of these must be less 
unstressed than others. This necessitates a sub-division of u into w(eak) and minimal or z(ero) stress. 
I have therefore marked the w-syllables in the example with a dot, leaving the z-syllables unmarked. 
Since all u syllables in the d&d are plotted as dots, those for w syllables should be re- marked thus: = 
or = by drawing a small cross-bar across the earlier dot. 

In spite of the absence of pitch jumps, etc., the difference between w and z is quite easy to perceive 
or produce, although again not paradigmatically, only in longer stretches. Thus *.hau anskamfo*.tabal 
dat .09 1 waz .an ad.mints-trerfnn .tfaa “d3.# would sound quite un-English to any native speaker. 
Although a w-syllable seems (and should be) marginally more prominent than its z-equivalent, that is 
only so because it is the (neurocognitive) ictus rather than a remiss in a tertiary grouping. What matters 
most then, perceptually, is (extremely subtle!) timing, ie. the different nc groupings 
ofsyllables/words, with -in this case- tertiary byte boundaries with their opener junctures, in different 
places. It appears that, much as we may dislike such complexities, tertiary bytes and byte-dividers are 
inevitable not just in detailed analysisof musical, but also of linguistic rhythms. 

It also appears from this ‘uncomfortable’ example that sometimes one can choose which syllables 
to make w/z, but sometimes one cannot. If the meaning of a z-word is ‘given’ activation or ‘automatic 
reflex’ or ‘subsubsub-specification’ (wheels within wheels), that of an obligatory w-word is the same, 
but that of an optional w-word may be said to be a nc subsub-specification or ‘preferred given’ 


Te LUCAS VAN BUUREN 


activation. 


STEP 8. Draw all byte boundaries. Having plotted all M and w syllables RAX demands that any 
primary bytes containing these be subdivided by drawing secondary and tertiary byte boundaries. This 
can be seen as a continuation or refinement of the primary byte-division in Step 4. A secondary divider 
B is to be drawn between every M and M or stronger stress in the same byte, a tertiary division D between 
w and every equal or stronger stress in the same byte (e.g in the ‘comfortable’ examples just given). To 
the question ‘where exactly?’, the answer is again: at the highest syntactic-semantic boundary with the 
openest juncture. 

If the meaning of a word is neurocognitive concept, that of a byte is concept constellation or 

thought. Then a secondary byte is a sub-thought and a tertiary byte a subsub-thought. As in the bus 
conductor example, the implication is that, neurocognitively, the lower thought is activated before 
being encompassed, as a wheel within a wheel, by the higher one. 


STEP 9. Plot all piece/locution boundaries. The final step is to ‘upgrade’ one or more byte = boundaries 
to piece + or locution # boundaries. A piece is what is better known asatune ortonegroup. Its final syllable 
is delayed or stretched noticeably more than a byte-final syllable and moreover ends in a slight 
downturn in pitch, ~ or *for an F(alling) or an upturn Zor~for an R(ising) tune. The up or downturn 
should therefore be joined to (the end) of the last dot or dash before converting the /into" or # 


APPENDIX IT. BACKGROUND 


The intonational analysis demonstrated is essentially that of Lilias Armstrong and Ida Ward (1926). 
Their vastly underrated early solution to a still ill-understood phenomenon starts with a so-called dots- 
and-dashes notation between two parallel lines representing the speech melody. All speech analysis, I 
feel, should start with this. It then divides all speech melodies into falling and rising tunes, with the 
meanings (sic!) ‘finality’ and ‘non-finality’ respectively and with one of three pitch-accents on “the 
words which are felt by the speaker to be important.” Exactly the same ‘embodied’ form tt meaning 
approach, was also pursued fifty years later by the first LACUS president Dwight Bolinger in his 
crusade (1972ff) against ‘computational’ generative phonology. 

The rhythmic approach goes back directly to that of my mentor and teacher David Abercrombie 
(1964a,b). Van Buuren (1975) adapted the Abercrombie-Halliday phonological hierarchy to a 
rhythmic form tt meaning one, thus: syllable < word ttconcept < bitttthought < piecettidea < 
locutionttsententia. Instead of feet and tonegroups phonetically more neutral terms like bits (later 
bytes) and pieces were advanced. If the phonological word is defined as a close-juncture unit the 
temporal junctures become more and more ‘open’ at word, byte, piece and locution boundaries. 

This approach was further developed largely on the basis of Paul Fraisse (1974) Psychologie du 

Rythme and Grosvenor Cooper & Leonard N. Meyer (1960) The Rhythmic Structure of Music. From 
Fraisse we learn that rhythm is essentially a matter of grouping bodi/y movements within a “présent 
psychologique’ (also known as ‘short term memory’ or ‘span of consciousness’) of up to about seven 
seconds. 

Cooper&Meyer demonstrate that (musical) rhythms are grouped ‘architectonically (i.e. both into 
constituent ‘trees’ and also sequentially) into “the five basic groupings traditionally associated with 
prosody.” Still using an (indispensable!) mnemonic I learnt as a boy from my classics teacher Dr. 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 73 


Hoekstra, I call these five ‘feet?’ yamBOS (rather than English iAMB), TROchee, DACtylus, 
amPHIbrach and anaPAEST, pronounced jambvps, trouki, dektilas, amfarba(r), anapiist or BTDFPM for 
short. M for MONE mon (rather than ‘moan’?) must of course be added, as when a musical bar is taken 
up by a single whole-note. 

It is clear from C&M that in music one cannot have more than two weaker beats between stronger 
ones and that the groupings (bars, sub-bars, subsub-bars...) make up tree hierarchies. This also 
transpires when counting (or tapping) time for instance to one’s drum or piano-playing, thus: 


ONE TWO! THREE FOUR |... (repeat ) four quarter J notes 
ONE an! TWO an! THREE an! FOUR an |... (repeat) eight 17. 2) notes 


ONE y ian y'TWOy iany'|THREEyiany'FOURYy ian y|... (repeat ) sixteen ee 2 notes 
ONE TWO | THREE FOUR | ONE | ONE three | ONE two three | ONE /|/ 


The byte (rather than bar or foot) in the first line makes up a tree of one TROchee (ictus on ONE , 
remiss on THREE) consisting of a primary and a secondary TROchee, thus T(Tt). The tree structure in 
the second line is then T(T(Tt)t(Tt)), and so on. These tree-notations become rather complicated (and 
space-consuming when pictured), so perhaps another ‘hoekstra oralisation’ is called for. Thus, for line 
1: trou kil tro ki |... (using vowel ‘reduction’ to betoken beat reduction), and TT as awritten formula. 
The rhythmic structure of line 2 would then be trou ki! tri kil tro kil tr ki] (spaces indicating word 
boundaries), or, in writing, TtTt, that of line 3 trou kij tra ki! trt kif tra kil tro kid tro ki! trr kif tra kil... or 
TtttTttttt, and that of line 4 trou kil trv ki | mon] trou kil] dek t1 las| myn// or TTMTDM. 

Observe that the third line exhibits a four tier hierarchy with the quarternary F byte dividers adding 
a fifth degree of beat. And so on. And on? A five tier hierarchy with no less than six degrees of beat 


(1/5) notes) is still within easy reach for musicians, and good drummers may do even better than that. It 
seems that in the evolution of the human animal she must have developed an incredible sense of 
rhythm. However, speech bytes, unlike musical bars, never seem to sub-divide beyond a three tier 
hierarchy with four degrees of beat or (rhythmic) ‘stress’ (except perhaps in counting time as above). 
Some more detail in my three-tier rhythmic analysis (with ‘trees’ and intensity tracings) of two lines 
from Kipling’s Manda/ay (vBuuren 2012, 2013). 

Stronger beats tend to be made (and said) with slightly more energy than weaker ones. It also 
appears from Fraisse’s tapping experiments that even at secondary and tertiary byte-boundaries we 
tend to slightly delay (opener junctures!). But some of these finer details may not always be perceptible or 
even effectuated. ‘Stress’ is not a cherry on the cake one can leave out or move about, but an integral 
element of larger temporal patternings in people’s brains. 

Considering that linguists, too, widely accept (v. Nespor and Vogel 1986:282) a Rhythmic 
Alternation Principle (RAP) saying ‘no more than two weaker beats between stronger ones (and no 
more than one after/before pause)’ —which is then blithely ignored in phonological analyses including 
pronouncing dictionaries-, all of this leads to postulating it as a Rhythmic Alternation Axiom (RAX). 
Like any mathematical axiom that entails a consistent body of logical conclusions, while showing up 
those that flout it. On a practical level, RAX may perhaps be seen as our sniffer-dog revealing any 
incongruities. 

One immediate conclusion is that RAX allows for only nine one-ictus groupings, which, 
revealingly, largely coincide with those proposed millennia ago by Damon, Aristarchos, Aristoxenes, 
Plato and others for poetry, music, dance and speech alike (cf. Fraisse !974:5-6, Yeston 1976:ch.1). 


74. LUCAS VAN BUUREN 


Putting two and two together this then leads to the definition of rhythmic control or rhythmicality 
given on page 4 of the article. It may be compared with earlier versions in vBuuren (2003:3) and 
subsequent articles. 

So much for the background to the rhythm and intonation approach. It remains to say a little about 
the syntactic and semantic views here subscribed to. As already indicated, much of that also goes back to 
the 1960s ‘(neo-)Firthian’ Edinburgh school, in particular to the (early) teaching and work of Michael 
Halliday (1961, 1963, 1975). It is interesting to reflect that Halliday was later elected LACUS president 
and that many members identify with his embodied functional-systemic approach. 

A second inspiring influence was from my Amsterdam slavicist colleague Nel Keijsper, herself a 
follower and admirer of Bolinger’s embodied formttmeaning approach to ‘accentuation’. The 
‘meanings here distinguished owe much to her unequalled semantic insights. Also, her 1985 
Information Structure, an investigation into the Functional Sentence Perspective theories of the 
Prague and Leningrad schools was major boost to the present enterprise. 

A final decisive factor was the influence and encouragement from the Linguistic Association of 
Canada and the United States which I joined only (far too) late in life, and in particular that of yet 
another former LACUS president, Sydney Lamb. His neurocognitive theory of linguistics (Lamb 1999) 
in my view offered a principled approach to at least three ‘awkward’ questions. Firstly, in spite of (still) 
inadequate knowledge of neural processing, linguistic statements in neurocognitive terms are 
verifiable/falsifiable to a considerable extent, and if not yet in practice, at least ultimately in principle. It 
is therefore one if not the only way for linguistics ever to become a ‘hard’ science. Secondly, being 
‘embodied’ by definition, a neurocognitive approach keeps one well away from implausible 
disembodied and computational attempts at accounting for speech and language. Thirdly, rather than 
taking the usual hearer’s perception point of view in matters like ‘meaning’, it seems to require taking a 
speaker's production point of view in the first place - indeed my usual practice since I first started 
teaching English to foreign learners. If my nc statements still seem amateurish and ignorant, leaving 
much to be desired, that is perhaps to be expected at the present state of our (and my!) knowledge. It is 
hoped, however, that readers will regard this as a challenge rather than a deficiency. 


ON RHYTHM ANDINTONATION IN THESPEECH OF BARACK OBAMA 79 
APPENDIX III: 


LvBUUREN 2015, OBAMA RHYTHM/INTONATIONOMEANING ANALYSED 
CF. YouTube video, at 1 minute 27 seconds: https://www.youtube.com/watch?¢v=AL5kYJXiohc 


Z I+ | L= *i semantic status line, directly derived from in-line notation 
1. ixdidnt| .over'"promise”// TRanscript, with in-line notation directly derived from d&d analysis 


dots&dashes basic analysis (no word boundaries shown) 
am Yaibs| traki! ~trouki’ ||  FtTy rhythmic structure, first ‘oralised’, then formulaic 


z S| Tt | T+ | S| z E+ | wit zw ! S®/ 
2. ana.i | *didnt| , undelrestimate|-how|, how ~tough| .this! was .gonna! -be_/ 





sos -_\ é tee a 

jom.bvs|troukil ,.troki | xdektilas|.mpn|, jom~bvs|.man! am _— .fabo mpn_]| 
S|} Te | Lb YS | TH] wy S] Te | wl] lL "¥ 

, vam| , «i| “always| be.lieved_/ -that|, *this|. was} -a|, "long! ‘term|.project.# 


= | =| * | * es fee NN ee 


mon.mon| r"trouki| jombosv |! -«mon|,~mon|.manimon|,”mon! ‘man | trouki. # 


4. txwant|a short term| -project.# wan foutsum praasik., 
eel oe Ne ee am .faib3"| am fat ba “| -trouki.# F?| F| T# 


wi S| S [wy Tt | L= a2. | E+ |} zs 
5. .hal -and|, you-know| ,-for!, indij.vidual| aymeri'cans”// who .are| ‘struggling| right-now_// 


eim |. —- ] +) Vp ee te") * J * = | NU. |. me 


(m)MB|,m,TT|Em/ BI, TIB/ mon! .mpnijom -bvs|,, // jam .bvs |, ~trouki | jam -bvsv || 


6. ‘they| -have| "every| ~reason|to ~be} im,patient.# 


A ee ee ee a ee 


xmopn| -mon| “dektilas| -trouki| jambas! om.fatbo. # MMDT|bF# 


T+] oz T= | S= Zz S= $ = L- ‘# 
7. ~they| should rwant| ,-all| these -things| ~-solved| to.morrow.# 


Nt. ee 


‘mon| jam "bps |-mpn| jam “bps| “mpn| omfaibo. # MB,MBMF# 


Z H+] S| T=] IT [T+ | T= | § | L-*y 
8. 9°I,,it xdoe.nt|-matter| “how|.good| *my| eco”nomic| “theories| .are.// 


oS se Ne | ee | Oe EN ee 


jomr bos |om ~fatba| .trouki | "mpn| «mvn|mvnjonamrfatba| ~-trouki jmon./ 


w- z Tt= zz E== M B S*¥# 
9. .iff you ~don,| have a "job] ‘right! -now*%/ 


J Ne Be 


monijam ~bps| ana "plist | emiIn | "mon mBP| MM|/ 


76 LUCAS VAN BUUREN 


Z T+ | S | M fz S }|z L |S "yf z Lt F# 
0. theronly|, economic} .pol.cy | you’want|to .hear| -is.//,im “hired. # 


Be Nera nese ipa eee ee ee eee eee eee 
am ~farbo| ,.trvkil -troukil. trokilman| jam rbps |jam .bvs|.mpn.j/,am ~faibo.}f 


—_ 


zT=| zL-‘#M z T=] z L'#¢wiz T+ | T+ | z Lt f# 
11. | "got|a job. # .i| can "pay| my wbills.# .i} can ‘look| safter| my ‘family# 


fe et ty ee ee 
jom "bosom bps. #tamin tj am "bvs| jam  bvs. # «mani jam ~bvs | *trouki| am ~faibo., # 





12. .99m|, "but| , -what| , what i | under-stood| *coming| ‘in_// -was| -that| , 

ast reversing. , a ~oulture| -here| in -waishing.tOn.// <dOimi.nated| bu <speoial| xint.rests. # 
14. .99|, rerversing.// .ai porlitioal| -oulture_// that} was <dojmi.nated| .bu| , polls. # 
15. .and! xsoundl .bites.// and] , a *twen,ufour| -hour| news! .ouole.# 


16. .~99|, rerverlsing_// sstruotural| -prOblems| in .our! exoonomy| .that| had been building up| .for' , 
two ~deoades. ff 


17. .thats gonna -take| "time Y} thats GON take] -more| than a ~vear.|] its gonna, -take| 
, <nore| than *two| ~vears.,/) .its |, "GOMna| take] , .more| than ~one| term} xprob,blu| -takes| 
-more| than ~one| "preis;.d.nt¥ 


S= zT+t= S$-we= T+BzM =z T+ = T= =z L--zw*s 
18. -SO|,-3| ,SO %i[,-tru}.to] ,*keep; in.mind| the im~meidi.ate|,”ohal| len-ges| net me |] 





w -T== w -L- ¥ T+=w-T+t z = T+ =z Le Ff 
19. .dau} "in| ‘dau} .out.// ~hOw|.i | ‘put people] xbaok| to -work~// 
Se ee Ne Ne, ee 


w- T+ = Tt= z S=z ze EF = § FI 
20 .and| ~put| , <steps| in-place] that , can vhelp|-people/ 
VJ oN UT. fT 


.manimon| , <movn| jam.bvs|ma?, jam "bps | ~trouki* // mul], W In? ki) 


zE=|zz M ! S Fi S| Etj~zz M ! S f# 
21 +, to oe in i meen “class , -and| , *stay| in the .middle| -class.# 





jom wanes anam°fiba |\-mvn//,-mon| ,mvn| anam.fiba | .mon.. # BHEM/M MI|FM# 


This article was first published at lacus.weebly.com. 


